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Summary  of  Main  Results 


In  Code-Division  Multiple-Access  (CDMA),  each  transmitter  is  assigned  a  fixed,  dis¬ 
tinct  signature  waveform  which  he  uses  to  modulate  his  message  in  the  same  fashion  as 
in  single-user  communication.  Then  the  information  sent  by  each  user  can  be  demodu¬ 
lated  by  correlating  the  received  signal  with  each  of  the  signature  waveforms.  This  demo¬ 
dulator,  whose  use  is  widespread  in  practice,  is  referred  to  as  the  conventional  single-user 
detector.  As  is  v>  11-known,  when  the  channel  output  is  corrupted  by  additive  white 
Gaussian  noise,  the  conventional  single-user  detector  minimizes  the  probability  of  eiror  in 
a  single-user  channel,  i.e.,  in  the  absence  of  interfering  users.  The  fact  that  this  is  no 
longer  true  in  the  multiple-access  channel  is  the  raison  d’etre  of  the  area  of  multiuser 
detection. 

The  performance  of  the  conventional  single-user  dt  actor  is  acceptable  provided  that 
the  energies  of  the  received  signals  are  not  too  dissimilar  and  that  the  signature 
waveforms  are  designed  so  that  their  crosscorrelations  are  low  enough  (this  depends  on 
the  desired  maximum  number  of  simultaneous  users).  In  practice,  low  crosscorrelations 
are  usually  achieved  employing  Spread-Spectrum  Pseudonoise  sequences  of  long  periodi¬ 
city.  If  the  received  signal  energies  are  indeed  dissimilar,  i.e.,  some  users  are  very  weak  in 
comparison  to  others,  then  the  conventional  single-user  detector  is  unable  to  recover  the 
messages  of  the  weak  users  reliably,  even  if  the  signature  waveforms  have  very  low 
crosscorrelations.  This  is  known  as  the  near-far  problem  and  is  the  main  shortcoming  of 
currently  operational  Direct-Sequence  Spread-Spectrum  Multiple-Access  systems,  and  of 
recently  proposed  systems  for  future  mobile  radio  communications. 

Due  to  the  reduction  of  multiple- access  capability  and  the  increase  of  vulnerability 
to  hostile  sources  caused  by  the  near-far  problem,  its  solution  or  alleviation  had  been  a 
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target  of  researchers  in  the  area  for  several  years.  Before  the  emergence  of  the  solutions 
based  on  multiuser  detection  developed  by  the  Principal  Investigator  and  his  coworkers, 
success  had  been  very  limited  and,  essentially,  the  only  remedies  available  were  power 
control  and  the  design  of  signals  with  even  more  stringent  crosscorrelation  properties. 
Unfortunately,  power  control  (i.e.,  the  adaptive  adjustment  of  transmitter  power  depend¬ 
ing  on  its  location  and  of  the  received  powers  of  the  other  users)  dictates  significant 
reductions  in  the  transmitted  powers  of  the  strong  users  in  order  for  the  weaker  users  to 
achieve  reliable  communication.  Thus,  power  control  can  become  self-defeating  since  it 
actually  decreases  the  overall  multiple-access  and  antijamming  capabilities  of  the  system. 
Furthermore,  more  and  mor  complex  signature  waveforms  lead  to  rapid  increases  in 
system  cost  and  bandwidth,  and,  as  we  have  noted,  do  not  eliminate  the  near-far  prob¬ 
lem.  For  these  reasons,  it  can  be  seen  why  the  practical  solution  to  the  near-far  problem 
achieved  in  this  project  can  be  objectively  considered  a  major  breakthrough  in  the  appli¬ 
cation  of  signal  processing  techniques  to  Spread  Spectrum  communications. 

The  chief  reason  why  multiuser  detection  did  not  develop  until  relatively  recently 
was  the  belief  shared  by  many  a  worker  in  Spread-Spectrum  that  multiuser  interference 
is  accurately  modeled  as  a  white  Gaussian  random  process,  and  thus  the  conventional 
detector  is  essentially  optimum.  It  is  not  difficult  to  build  an  infinite  population  mul¬ 
tiuser  signal  model  which  can  be  rigorously  shown  to  be  asymptotically  Gaussian  as  the 
individual  amplitudes  go  to  zero  with  the  appropriate  speed.  Unfortunately,  the  number 
of  transmitters,  signature  waveforms,  and  power  levels  encountered  in  many  practical 
situations  (e.g.  in  near-far  environments)  render  the  Gaussian  approximation  completely 
useless.  Therefore,  it  is  useful  to  adopt  a  more  refined  viewpoint  by  taking  the  realities 
of  the  medium  into  account,  modeling  them  and  exploiting  them. 

Prior  to  the  start  of  this  project,  it  had  been  shown  by  the  Principal  Investigator  in 
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"Minimum  Probability  of  Error  for  Asynchronous  Gaussian  Multiple  Access  Channels," 
IEEE  Trans.  Information  Theory,  Jan.  1986  and  "Optimum  Multiuser  Asymptotic 
Efficient,"  IEEE  Trans.  Communications,  Sep.  1986,  that  the  near-far  problem  is  not  an 
inherent  flaw  of  Direct-Sequence  Spread-Spectrum  systems  but  rather  of  the  simple  con¬ 
ventional  correlation  receiver.  Those  works  developed  optimum  multiuser  receivers, 
which  consist  of  a  front  end  of  matched  filters  followed  by  a  Viterbi-type  algorithm 
whose  number  of  states  is  exponential  in  the  number  of  users.  Those  receivers  did  not 
suffer  from  the  near-far  problem.  However,  their  implementation  suffered  from  the  bur¬ 
den  of  the  exponential  complexity  in  the  number  of  interferers  and  from  the  necessity  to 
estimate  the  received  powers  of  each  interferer. 

The  first  breakthrough  obtained  in  this  project  was  the  demonstration  that  for 
symbol-synchronous  multiple-access  channels  a  simple  modification  of  the  conventional 
correlation  receiver  results  in  a  system  which  does  not  exhibit  the  near-far  problem,  and 
moreover  provides  optimum  robustness  against  variations  in  the  received  strength  of  the 
various  transmissions.  We  called  this  new  receiver  the  decorrelating  detector  as  it  corre¬ 
lates  the  received  signal  against  a  linear  combination  of  the  waveforms  assigned  to  the 
active  users,  rather  than  only  the  waveform  of  the  user  of  interest.  The  coefficients  of 
such  a  linear  combination  do  not  depend  on  the  relative  strengths  of  the  transmitters 
and  can  be  precomputed  in  advance.  Such  linear  combination  is  such  that  the  detector 
correlates  with  the  projection  of  the  signature  waveform  of  the  user  of  interest  on  the 
subspace  orthogonal  to  the  subspace  spanned  by  the  interfering  waveforms.  Thus,  the 
decorrelating  detector  effectively  tunes  out  the  multiuser  interference. 

A  very  pleasant  surprise  was  to  demonstrate  that  the  decorrelating  detector 
achieves  optimum  near-far  resistance,  i.e.  the  same  level  of  prote*. Aon  against  the  near-far 
problem  as  the  optimum  detector.  This  means  that  knowledge  of  the  received  energies  is 
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not  required  to  combat  the  near-far  problem  and  that  a  receiver  whose  complexity  is 
similar  to  that  of  the  conventional  detector  achieves  the  same  degree  of  robustness 
against  imbalances  in  the  received  energies  as  the  optimum  detector,  with  its  exponential 
complexity.  Another  attractive  property  of  the  decorrelating  detector  is  that  its  bit¬ 
error-rate  is  independent  of  the  energies  of  the  interfering  transmitters--a  most  desirable 
feature  of  a  strategy  designed  to  combat  the  near-far  problem.  The  signal- to- noise 
reduction  due  to  the  presence  of  interfering  waveforms  depends  exclusively  on  the 
(crosscorrelations  of  the)  signature  waveforms  assigned  to  the  transmitters,  and  not  on 
their  relative  power  levels. 

The  next  major  result  was  the  development  of  the  decorrelating  detector  for  the 
asynchronous  Code-Division  multiple- access  channel.  In  contrast  to  the  solution 
obtained  for  the  synchronous  channel,  the  asynchronous  decorrelating  detector  is  a  linear 
system  with  memory  which  can  be  implemented  by  transversal  discrete-time  filters. 
Although  the  technical  development  and  bit-error-rate  analysis  is  complicated  by  an 
order  of  magnitude  in  the  asynchronous  case,  all  the  desirable  features  of  the  synchro¬ 
nous  decorrelating  detector  are  retained:  optimal  near-far  resistance,  independence  of 
bit-error-rate  to  relative  power  levels,  low  complexity,  and  the  fact  that  it  is  unnecessary 
to  know  the  received  energies  of  the  individual  interferers  at  the  receiver. 

The  main  increase  in  complexity  of  the  decorrelating  detector  with  respect  to  that 
of  the  conventional  matched  filter  is  the  need  to  lock  to  the  signal  epochs  of  the  interfer¬ 
ing  users,  which  can  be  accomplished  by  a  bank  of  conventional  single-ue&r  synchroniza¬ 
tion  systems,  or  (in  a  promising  direction  for  future  research)  by  multiuser  synchronizers. 

We  have  also  developed  low-complexity  approximations  to  the  decorrelating  detec¬ 
tor,  which  do  not  require  the  iterative  or  off-line  computation  of  equalizer  taps  and  per¬ 
form  promisingly  for  signature  waveforms  such  as  those  used  in  currently  operational 
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Direct-Sequence  Spread-Spectrum  systems. 

Although  the  decorrelating  detector  does  not  require  knowledge  of  the  powers  of  the 
interfering  transmitters,  we  have  found  a  class  of  related  linear  receivers  which  makes 
effective  use  of  this  information.  Consequently,  we  have  tackled  the  problem  of  ampli¬ 
tude  estimation  in  a  multiuser  white  Gaussian  channel  and  have  suggested  promising 
techniques  that  can  be  brought  to  bear  on  this  problem.  Those  techniques  can  be 
categorized  as  being  based  either  on  maximum-likelihood  data  recovery  or  on  minimum 
error-probability  (i.e.,  Bayesian)  data  recovery.  In  each  case,  the  most  promising  algo¬ 
rithm  is  iterative  with  Gauss-Seidel  iteration  and  the  EM  algorithm  being  proposed  for 
the  ML  and  MEP  approaches,  respectively. 

We  have  also  considered  nonlinear  receivers  for  multiuser  channels.  Here,  each 
rece’ver  may  only  know  the  signature  sequence  of  one  of  the  transmitters  and  treats  the 
sum  of  all  the  other  transmitted  waveforms  as  noise,  which  is  neither  white  nor  Gaus¬ 
sian.  Particular  emphasis  is  placed  on  asymptotically  optimum  detectors  for  each  of  the 
following  situations;  weak  interferes;  CDMA  signature  waveforms  with  long  spreading 
codes;  and  low  background  Gaussian  noise  level. 

We  have  also  looked  at  issues  of  computational  complexity  of  combinatorial  optimi¬ 
zation  problems  arising  in  multiuser  detection.  We  have  shown  that  minimum  bit-error- 
rate  demodulation  of  multiuser  signals  is  an  NP-complete  problem  in  the  number  of 
users,  and  hence  polynomial  algorithms  are  out  of  the  question  unless  well-known  prob¬ 
lems  such  as  the  traveling-salesman  and  integer  linear  programming  can  also  be  solved  in 
polynomial  time.  Fortunately,  however,  minimum  bit-error-rate  receivers  are  not  the 
only  ones  that  are  resistant  against  the  near-far  problem.  In  particular,  when,  in  lieu  of 
bit-error-rate,  near-far  resistance  is  the  optimality  criterion,  then  the  complexity  of  the 
optimal  receiver  (the  de  -orrelating  detector)  is  linear  in  the  number  of  interferers. 


-  6  - 


When  Spread-Spectrum  signaling  is  coupled  to  random-access  protocols  to  provide 
some  degree  of  Sow  control,  it  is  necessary  to  do  away  with  the  conventional  collision 
channel  model,  whereby  several  simultaneous  transmissions  result  in  the  destruction  of 
all  transmitted  messages.  We  have  proposed  a  new  model,  the  multipacket  channel, 
which  is  general  enough  to  encompass  random-access  channels  with  CDMA  or  with  cap¬ 
ture  (which  are  especially  relevant  in  near-far  situations).  More  significantly,  we  have 
found  the  maximum  throughput  achievable  by  the  ALOHA  algorithm  in  the  general  mul¬ 
tipacket  channel:  1)  in  the  open-loop  version  of  the  ALOHA  algorithm,  i.e.,  with  fixed 
retransmission  probabilities,  the  throughput  is  equal  to  the  limit  of  the  expected  number 
of  successfully  received  packets  per  slot  as  the  backlog  goes  to  infinity,  and  2)  the 
throughput  of  closed-loop  ALOHA,  wher.  he  retransmission  probabilities  are  a  function 
of  the  channel  outcomes,  is  equal  to  the  maximum  over  v  of  the  expected  number  of  suc¬ 
cessfully  received  packets  per  slot  when  the  number  of  attempted  transmissions  is  a  Pois¬ 
son  random  variable  with  mean  v. 

Our  research  efforts  under  the  sponsorship  of  this  ARO  contract  have  nlso  been 
directed  to  the  investigation  of  related  issues  in  the  CDMA  optical  channe  vhich  is 
receiving  considerable  attention  both  in  commercial  and  military  application^  for  fiber¬ 
optic  and  free-space  photonic  channels.  The  emphasis  has  been  in  the  development  of 
formulas  for  the  bit-error-rate  of  simple  single-user  receivers,  which  can  be  computed 
efficiently  lor  large  numbers  of  users.  A  by-product  of  this  analysis  is  the  determination 
of  the  optimum  detection  threshold  as  a  function  of  the  number  of  users--a  problem  that 
finds  no  counterpart  in  the  conventional  Caussian  channel.  We  have  achieved  the  first 
exact  analysis  of  a  single-user  optical  matched  filter  detector  in  the  presence  of  an  arbi¬ 
trary  number  of  asynchronous  transmitters.  The  comparison  of  our  exact  analysis  with 
popular  approximations  on  user  synchronism  or  on  the  distribution  of  the  multiple-access 
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interference  points  to  severe  shortcomings  of  those  approximations,  as  the  optimal  thres¬ 
hold  is  consistently  underestimated,  leading  to  an  error  probability  which  is  an  order  of 
magnitude  above  the  one  that  can  be  obtained  by  a  simple  adjustment  of  the  threshold 
as  a  function  of  the  number  of  active  transmitters. 
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Abstract — The  stability  of  the  Aloha  random  access  algorithm  in  an 
infinite-user  slotted  channel  with  muliipacket  reception  capability  is 
considered.  This  channel  is  a  generalization  of  the  usual  collision  channel, 
in  that  it  allows  the  correct  reception  of  one  or  more  packets  involved  in  a 
collision.  The  number  of  successfully  received  packets  in  each  slot  is 
modeled  as  a  random  variable  which  depends  exclusively  on  the  number 
of  simultaneous  attempted  transmissions.  This  general  model  includes  as 
special  cases  channels  with  capture,  noise,  and  code  division  multiplexing. 
It  is  shown  by  means  of  drift  analysis  that  the  channel  backlog  Markov 
chain  is  ergodic  if  the  packet  arrival  rate  is  less  than  the  expected  number 
of  packets  successfully  received  in  a  collision  of  n  as  n  goes  to  infinity. 
Finally,  the  properties  of  the  backlog  in  the  nonergodicity  region  are 
examined. 


I.  Introduction 

ONE  of  the  main  problems  in  random  access  communications 
is  the  determination  of  the  maximum  stable  throughput.  In 
particular,  an  important  result  is  that  the  Aloha  protocol  is 
unstable  [l]-[3]  in  an  infinite-user  slotted  collision  channel 
where  a  transmission  is  successful  only  if  no  other  users  attempt 
transmissions  simultaneously.  Several  strategies  have  been  de¬ 
signed  to  stabilize  this  channel,  such  as  collision  resolution 
algorithms  (see  [4],  for  example)  where  transmissions  are 
deferred  until  the  current  conflict  is  solved,  and  more  recently, 
Aloha-type  strategies  using  decentralized  control,  where  the 
retransmission  probability  is  updated  according  to  previous 
channel  outcomes.  It  has  been  shown  [5]-[7]  that  the  maximum 
stable  throughput  achievable  by  such  Aloha-type  strategies  with 
decentralized  control  is  e~'. 

However,  the  collision  channel  model  does  not  hold  in  many 
important  practical  multiuser  communication  systems  [8]-[21] 
because  simultaneous  transmission  of  several  packets  does  not 
necessarily  result  in  the  destruction  of  all  the  transmitted 
information.  For  instance,  the  capture  phenomenon  is  common  in 
local  area  radio  networks  [12]-[15];  if  the  power  of  one  of  the 
received  packets  is  sufficiently  large  compared  to  the  power  of  the 
other  packets  involved  in  a  collision,  then  the  strongest  packet  can 
be  correctly  decoded,  while  the  other  packets  are  lost.  Other 
examples  are  multiple-access  channels  where  several  users 
transmit  simultaneously  in  the  same  frequency  band,  and  a 
multiuser  detector  demodulates  the  information  transmitted  by  all 
active  users  (e.g.,  [8]— [1 1]).  Although  those  systems  do  not 
necessarily  require  a  random  access  protocol,  it  is  sometimes 
useful  to  exercise  some  flow  control  through  such  a  protocol  so  as 
to  limit  the  maximum  number  of  simultaneous  transmitters,  in 
order  to  bound  the  multiuser  receiver  complexity  and  guarantee 
lower  bit-error  rates. 
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Previous  studies  of  some  of  the  aforementioned  systems  [9], 
[12]— [18]  where  some  of  the  packets  involved  in  a  collision  may 
be  correctly  received  have  shown  that  the  performances  are 
noticeably  improved  with  respect  to  slotted  Aloha.  However,  even 
in  those  special  cases,  no  precise  stability  result  is  available,  either 
because  finite  population  networks  with  no  buffer  space  were 
considered,  or  because  the  Poisson  approximation  of  channel 
traffic  was  used  for  infinite  population  networks.  In  [19]  (see  also 
[20]),  upper  and  lower  bounds  are  derived  for  the  capacity  of  a 
multiple  access  channel  where  all  packets  are  correctly  received  if 
the  collision  size  does  not  exceed  a  fixed  threshold  and  otherwise 
all  packets  are  destroyed. 

In  this  paper,  we  consider  a  generalization  of  the  collision 
channel,  where  the  receiver  can  demodulate  several  packets 
simultaneously.  It  is  assumed  that  the  number  of  correcdy 
demodulated  packets  is  a  random  variable,  which,  given  the 
number  of  packets  simultaneously  transmitted,  is  independent  of 
the  backlog  and  of  the  number  of  previous  retransmission 
attempts.  This  random  variable  can  take  any  integer  value 
between  zero  and  th  collision  size.  Thus,  the  channel  is  described 
by  a  matrix  of  conditional  probabilities  («„*)  where  tnk  is  the 
probability  that  k  packets  are  correctly  demodulated  given  that 
there  were  n  simultaneous  transmissions.  We  analyze  the  usual 
Aloha  algorithm  with  the  multipacket  reception  capability  just 
described.  Users  are  synchronized  so  that  transmissions  take  place 
within  one  slot,  and  at  the  end  of  each  slot,  stations  that  did 
transmit  a  packet  learn  whether  or  not  their  transmission  was 
successful  Unsuccessful  or  backlogged  packets  are  retransmitted 
in  each  subsequent  slot  with  probability  p,0  <  p  <  1 .  It  turns  out 
that  multipacket  reception  capability  can  stabilize  Aloha.  Our 
main  result  states  that  the  maximum  stable  throughput  is  equal  to 
the  limit  of  the  average  number  of  packets  correctly  received  in 
collisions  of  size  n  when  n  goes  to  infinity.  To  show  this,  we 
model  the  channel  backlog  as  a  Markov  chain,  and  then  study  its 
properties  by  using  some  simple  drift  analysis  techniques. 

The  last  part  of  this  paper  is  a  study  of  the  properties  of  the 
backlog  in  the  nonergodicity  region.  Unlike  the  backlog  Markov 
chain  for  slotted  Aloha  which  is  always  transient  [1],  the  backlog 
for  our  model  does  in  general  have  a  null  recurrence  region  of 
positive  length,  which  depends  on  the  matrix  (e„k)  and  on  the 
retransmission  probability  p.  However,  transience  in  the  nonergo¬ 
dicity  region  can  be  ensured  for  a  large  class  of  systems,  and  in 
particular  for  channels  where  the  number  of  successful  simultane¬ 
ous  transmissions  is  bounded. 

II.  Multipacket  Reception  Model 

Let  Ak  be  the  number  of  new  packets  arriving  during  time  slot 
k.  Assume  that  (,Ak)k7.0  are  i.i.d.  random  variables  with 
probability  distribution: 

P[Ak=n)  =  \n  (ua  0) 

such  that  the  mean  arrival  rate  X  =  S"m|  n\„  is  finite.  New 
packets  are  transmitted  with  probability  one  at  the  beginning  of 
the  first  slot  following  their  arrival. 

Given  that  n  packets  are  being  transmitted  in  one  slot,  we  define 
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for  n  a  1,  0  £  k  <.  n 

tnk  =  P[k  packets  are  correctly  receded  |  n  are  transmitted]. 

The  multipacket  reception  properties  of  the  channel  are  summa¬ 
rized  by  the  stochastic  matrix 
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which  we  refer  to  as  the  reception  matrix  of  the  channel.  For 
instance,  the  reception  matrix  for  the  usual  collision  channel  is 

*0  1 
1  0 
1  0 
1  . 


while  for  a  system  with  capture  it  has  the  form 

’  0  1  ’ 
1  *2  0 

I-*,  X„ 


where  x„  is  the  probability  of  capture  given  that  the  collision  size 
is  n.  The  model  studied  in  [19],  [20]  can  be  described  by  a 
reception  matrix  of  the  form 

“0  1 

0  0  1  0 

0  0  1  . 

1  0 

1  0  0 


Note  that  by  letting  t^iO  our  model  allows  not  only  collisions 
but  also  background  noise  to  be  a  source  of  errors. 

Denote  by  X„  the  number  of  backlogged  packets  in  the  system 
at  the  beginning  of  slot  n.  The  discrete-time  process  (X„/nz0  is 
easily  seen  to  be  a  homogeneous  Markov  chain.  We  define  the 
system  to  be  stable  if  (X„)„£0  is  ergodic  and  unstable  otherwise. 
The  average  number  of  packets  correctly  received  in  collisions  of 
size  n  is  denoted  by  Cn  =  SJ.,  kenk.  We  can  now  state  the  mam 
result. 

Theorem  l:  If  C„  has  a  limit  C  =  limfl_«  C„,  then1  the  system 
is  stable  for  all  arrival  distributions  such  that  X  <  C  and  is 
unstable  for  X  >  C.  This  also  holds  if  C  is  infinite,  iflim,,..*,  C„ 
=  +  oo,  then  the  system  is  always  stable. 

The  proof  is  given  in  Section  III.  In  the  remainder  of  this 
section,  we  use  Theorem  1  to  analyze  several  simple  random 
access  channels  that  fall  within  the  scope  of  the  multipacket 
reception  channel. 

l)  Mobile  Users  with  Pairwise  Transmissions.  Consider  an 
infinite  number  of  transmitters  Tu  Ti,  •••,  and  an  infinite 
number  of  receivers  Rit  Rz,  •  ■ ,  whose  positions  in  the  plane  are 
i  i  d  random  variables.  Suppose  that  transmissions  are  pairwise 

’Tills  result  holds  under  the  assumption  that  the  Markov  chain  of  the 
number  of  backlogged  packets  is  irreduciv  nd  aperiodic  (for  details  and 
sufficient  conditions,  see  Section  111). 


A 


A  ;  TRANSMITTER 
Q  :  RECEIVER 

Fig.  1.  Pairwise  transmissions  with  only  one  success  (3-3). 


in  the  sense  that  transmitter  Tn  sends  packets  only  to  receiver  Rn, 
and  Rn  is  only  interested  in  the  packets  sent  by  T„  (see  Fig.  1). 
Assume  also  that  each  receiver  can  only  detect  correctly  the 
packet  sent  by  the  closest  transmitter  (in  particular,  this  is  the  case 
if  there  is  perfect  capture,  see  Example  3  below).  The  successes  of 
transmissions  occurring  at  the  same  time  are  independent,  so  that 
for  n  2:  2 


p(n)kV-p{,n)y-k 

where  p(n)  is  the  probability  that  any  given  transmitter  is 
successful  in  a  collision  of  size  n,  which  is  equal  to  l/n  if  we 
assume  that  all  locations  are  memoryless,  i.e.,  independent  from 
slot  to  slot.  It  follows  that 

C„  =  np(n)=  1 

and  the  maximum  throughput  is  1 .  More  generally,  if  because  of 
channel  noise,  the  message  of  the  closest  transmitter  is  received 
correctly  with  probability  a  (in  other  words  elt  =  a),  then  the 
throughput  is  equal  to  a.  The  assumption  that  the  locations  of  the 
stations  are  memoryless  is  equivalent  to  assuming  that  they  move 
infinitely  fast  If  this  simplify  ing  assumption  is  dropped,  then  the 
number  of  successes  depends  not  only  on  the  current  number  of 
retransmissions,  but  also  on  the  previous  history  of  retransmis¬ 
sions,  and  thus  the  problem  is  no  longer  encompassed  by  our 
multipacket  reception  model.  In  Fig.  2,  the  result  of  a  simulation 
shows  that  for  moderate  speeds,  the  actual  throughput  is  well 
approximated  by  the  foregoing  analysis. 

2)  Frequency  Hopping  Random  Access  Channel:  Consider  a 
finite  population  of  N  users  transmitting  by  frequency  hopping,  as 
in  [11],  [22].  For  each  packet  he  wants  to  transmit,  a  user  selects 
with  equal  probability  one  frequency  in  a  fixed  set  of  q 
frequencies.  A  packet  is  correctly  received  iff  no  other  packet  is 
transmitted  on  the  same  frequency  during  the  same  siot.  We 
compute  (fink)\ik£N,  and  C  =  lim*-®  CN.  If  the  users  have 
infinite  buffer  space,  then  C  can  be  taken  as  a  good  approximation 
for  large  N  of  the  maximum  stable  throughput  of  the  system, 
which  is  unknown.  If  the  users  have  no  buffer  space,  as  is  often 
assumed,  the  back.- Jg  Markov  chain  is  always  ergodic,  but  even 
then,  one  should  expect  reasonable  delays  in  large  population 
problems  only  for  arrival  rates  below  C.  The  computation  of  the 
reception  matrix  of  this  channel  is  a  simple  combinatorial  problem 
of  random  assignment  of  objects  to  cells  (e.g.,  see  [23,  App.  A]), 
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P[S={TU  T2,  •••,  Tj}] 


and  the  following  decomposition: 

Tit  •••,  7}}  c  5]  =  P[5={r„  7-j,  •••,  7}}] 


U  {{7-„  c5} 


easily  yields  the  desired  expression 


P[S={TU  T2,  7)}1=2  (-0* 

Jt-0 


•  g(g~0  "•  (Q-j~k+l)(q-j-k)N-J'k 

q n  w 

for  1  <  j  <  N.  Notice  in  particular  that  €n,n-  i  =  0.  Let  us  now 
compute  the  average  number  of  packets  correctly  received  in 
collisions  of  size  N,  C,v  =  ,  ye/yy-  By  using  (4)  and  summing  at 


j  +  k  constant,  we  get 

N 

4NCN='%  q(q-l)  •••  (<7-/+ 1)07-0"'' 


2  (-l)n 


nl(i-n-l)<(N~i)\ 


0  5  10  15  20  25  30 

VELOCITY 

Fig.  2.  Throughput  as  a  function  of  velocity  for  mobile  users  with  pairwise 
transmissions.  Stations  moving  in  a  square  region:  velocity  units:  percent¬ 
age  of  square  side  traveled  in  one  slot.  Retransmission  probability  set  to 
0.1. 


Denote  by  Tu  T2,  •••,  TN  the  users,  all  involved  in  the  collision, 
and  also  denote  by  5  the  set  of  users  whose  packets  are  correctly 
received.  Two  cases  need  to  be  considered. 
a)  2  N  <  q:  We  have,  for  1  <  j  <  N 


which  can  be  simplified  as 

qNc^i-mN-iy. 

•  t?(t7—l)  •••  07-/+l)07-/)"-'(l-l)‘-> 

to  get  the  final  result 


Cn=N  1  — 


1  \tf-i 


b)  N  >  q:  In  this  case,  there  can  be  at  best  q  -  1  successes 
in  a  collision  of  size  N.  The  same  method  applies  to  get  the 
following  probabilities: 


N\  ( N-j 


9(9-1)  •••  (q-j-k+[)(q-j-k)N-J-k 


(ls/<9-l) 


eNJ=0  (q^j<N) 

resulting  in  the  same  expected  number  of  successes  as  before 


•  (^V)Wk  t*  *".WSS)  (2) 

where  only  one  term  is  left  to  compute 

P[{T>,  T2,  cS] 

9(9-1)  (q-j~k+\)(q-j-k)N-J-k 

qN 

for  1  <  j  <  N,  0  <  k  £  N  -  j.  Putting  (l),  (2),  and  (3)  together 
gives  the  result 


Now  we  let  the  population  size  N  go  to  infinity  and  we  apply 
our  result.  If  we  let  N  grow  to  infinity  while  keeping  q  constant, 
we  have  lim/v-»  CN  =  0,  so  the  system  is  always  unstable.  On  the 
other  hand,  if  we  let  N  go  to  infinity  while  keeping  q  equal  to  a 
fixed  percentage  of  the  population  size,  i.e.,  N/q  constant,  then 
lim^-oo  Cn  =  +oo,  and  the  system  is  always  stable.  It  is  easily 
•shown  that  to  get  a  finite  maximum  stable  throughput,  q  has  to 
grow  as  M In  N. 

3)  Mobile  Radio  Network  with  Capture:  Consider  an  infinite 
number  of  Users  independently  and  uniformly  distributed  in  a 
circle  of  radius  R,  whose  positions  are  independent  from  slot  to 
slot.  Users  transmit  packets  to  a  common  receiver  located  at  the 
center  of  the  network.  Denote  by  P\  and  P2  the  received  powers 
of  the  strongest  and  the  next  to  strongest  packets  involved  in  a 
collision.  Assume,  as  in  [12]~[14],  that  the  strongest  packet  is 
correctly  received  iff  P\/P2  >  K  (K  being  a  system  dependent 
constant),  and  that  all  the  other  packets  involved  in  the  collision 
are  not  received  successfully.  Assume,  moreover,  that  the 
received  power  of  a  packet  only  depends  on  the  distance  r  between 
the  sender  and  the  receiver 


constant 


(a  a  2). 


Then  there  will  be  capture  iff 


rr>0r, 

where  /?  =  KUa  is  the  capture  parameter,  and  rt,  r2  are  the 
distances  of  the  closest  and  the  next  to  closest  senders  from  the 
receiver. 
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Denote  by  D  the  distance  between  a  given  user  and  the  and  for  /  >  1 
receiver.  It  is  easily  shown  that  the  pdf  of  D  is  given  by 


Po(r)  =  2—1  (0  Si-zR). 


n*  0  Ji-* 


Given  /V users,  denote  by  UN  the  closest  from  the  receiver,  and  by  [*  '  1  ®  ^ 

Dn  its  distance  from  die  receiver.  Computing  the  cdf  of  DN  and  Pa=  BfO)  +  2j  Bt(j)t)Q  +  ^  2/  Bt(j)en 

taidng  its  derivative,  we  obtain  *-  f*1  J  n-i  /-o 

PDN(r)*.2N^i  N  '  iO*r<;R).  (5)  Ptj*k-  S  Xk+"  £  BiU)ej+k+n.n  (kzl). 

R  L  \R/  J  nm0  J~° 


PDN{r)*.2N-jp  I  1-^J  J  '  iO£rsR).  (5)  p«+*  =  £  Xk+"  £  B‘U)ej+k+n.n  (kzl).  (8) 

Given  DN=r,  the  other  N  -  1  users  are  uniformly  distributed  in  Sufficient  conditions  for  (X „)„*<,  to  be  irreducible  and  apen- 
thc  annular  region  (r,  R).  So  if /V  users  collide  and  DN  =  r,  Us  0(110  are  as  follows: 
will  be  correctly  received  iff  all  the  other  users  are  in  the  annular  «  if  0  <  p  <  1 : 
region  {fir,  R),  which  is  empty  if  fir  >  R.  Therefore,  if  we  denote 

by’  Xo*0  (9a) 


f>iv(r)  =  P(capture|Ar collide,  D^=r]  ((Va2) 


we  have 


(  fl2-/?2/-2!"-'  R 

l  — r~r  -----  if  rsS- 

R2-r>  J  fi 


■'  R 

,,r£? 


Thus,  the  probability  of  capture  in  a  collision  of  N  (N  >  2)  is 


[■a/a 

e/vi  =  Jo  Pv(r)pDN{r)  dr. 


Bi{j)  =  [  ■  P'd-P)'-' 


PoO  =  'hJr  ^nknit 


Pok=^/  "kk+ntk+n.n  1) 


Xo+  2/  1 


e  if  p  =  1: 


Using  (5)  and  (6),  and  with  the  change  of  variable  X  -  fi/R,  this 
is  easily  computed 

(l/S  i 

«AM=\  2Mr(l dx~—, . 
jo  j32 

It  follows  that  C  »  l/fi2  is  the  maximum  stable  throughput. 
Notice,  in  particular,  that  for/3  =  1  (perfect  capture),  we  have  C 
=  1  and  for  fi  —  co  (no  capture),  we  have  C  -»  0. 

Under  cenain  conditions,  the  performances  of  Aloha  in  the 
multipacket  channel  can  be  improved  by  varying  the  retransmis¬ 
sion  probability  as  a  function  of  the  channel  history,  and  a 
maximum  stable  throughput  of  sup.tJ.o  C„/n\x!'  can  be 

reached  (see  [31]). 

III.  Ergodicity  Region 

The  number  of  backlogged  packets  in  the  system  at  time  n, 
(X„)ni.o,  is  a  homogeneous  Markov  chain  whose  one-step 
transition  probability  matrix  can  be  computed  as  a  function  of  p, 
(X*)*ao>  and  E-  Denoting  by  B,{j)  the  probability  of  having  j 
retransmissions  out  of  /  backlogged  packets 


k  we  get 


Xo+  ^  kn£nn<  1 


for  all  /a  1,  £/07t  1.  (9d) 

These  are  only  sufficient  conditions,  but  they  hold  for  almost  all 
nontrivial  systems.  For  example,  if  (9b)  does  not  hold,  then  zero 
is  an  absorbing  state,  since  the  left-hand  side  of  (9b)  is  equal  to 
Poo •  Also,  (9c)  simply  means  that  the  successful  reception  of  a 
single  packet  in  the  absence  of  other  active  users  is  possible. 
Assume,  for  instance,  that  0  <  p  <  1  and  that  the  arrivals  are 
Poisson  distributed.  Then  we  only  have  to  assume  (9c),  and  (9b)  is 
true  unless  there  is  perfect  reception,  that  is  e,M  =  1  for  all  n  >  1, 
in  which  case  the  system  would  of  course  always  be  stable.  The 
case  p  =  1  gives  rise  to  a  number  of  pathological  situations, 
hence,  the  much  stronger  condition  (9d).  It  generally  turns  out 
that  either  (9d)  is  not  necessary  or  the  stability  region  of  the 
system  is  obvious.  For  instance,  it  is  clear  from  the  transition 
probabilities  that  slotted  Aloha  with  p  =  1  is  always  unstable.  In 
any  case,  it  is  assumed  in  what  follows  that  ( X is  irreducible 
and  aperiodic. 

Proof  of  Theorem  1:  The  proof  is  based  on  drift  analysis. 
Recall  that  in  general,  the  drift  at  state  /  (/  >  0)  is  defined  by 

di= E[X,+\-  X,\X,  =  i). 

If  we  denote  by  E,  the  number  of  successful  transmissions  in  slot 
t,  we  have 


*„,-*,  =  A, -S, 


and  therefore 


4=x-£[S,pr,=/].  do) 

Now  if  R,  is  the  number  of  retransmissions  in  slot !,  we  get 
P[S,=k\X,=i,  A,=n,  R,=j]-t„<.,.k 
for  0  s  j  £  i,  0  <  k  £  n  +  yand  with  the  convention  that  £W  = 
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C0  =  0.  Thus, 

E[Si\Xi  =  i,  At  =  n,  Ri—j]  =  Cnt.j 

and 

00  l 

E(l,\X,=i)^  2  X„  2  Bi(j)Cn+J.  (11) 

n- 0  7-0 

The  value  of  the  drifts  for  our  model  follows  from  (10)  and  (11) 

4  =  A- 2  BAJ)C»r  (12) 

0 

The  idea  of  the  proof  is  to  compute  limj-.„  d,  which  will  turn  out 
to  be  a  very  simple  expression,  and  then  apply  the  results  of  [3] 
and  [24]  to  determine  the  ergodicity  region  of  (Xn)n^0-  Let  us  first 
recall  the  two  results  that  will  be  used  in  the  sequel. 

Lemma  A  (Pokes  (24)):  Let  (X„)ni:0  be  an  irreducible  and 
aperiodic  Markov  chain  having  as  state  space  the  nonnegative 
integers,  denote  by  ( Py )  its  transition  probability  matrix,  and  by 
di  its  drift  at  state  i.  Then  if  for  all  /  \d\  <  <»,  and  if  lim  sup;„„  tf, 
<  0,  (Xn)„zo  is  ergodic. 

Lemma  B  (Kaplan  [3]):  Under  the  assumptions  of  Lemma  A, 
if  for  some  integer  /*/>0  and  some  constants  B  >  0,  c  6  (0,  1] 
the  following  two  conditions  hold,  then  (Xn)n£o  is  not  ergodic: 

i)  for  all  /  >  N,  d,  >  0 

ii)  for  all  i  >  N,  all  6  €  [c,  1],  0l  -  S ,Pti6'  2:  -B(  1  -6). 
From  (12),  it  can  be  seen  that  |<f,|  is  finite  since 

|4|<X+f;  X„  J  B,U)Cn+J*2\*ip. 

n» 0  y-0 

Next,  the  drift  limit  is  given  by  the  following  lemma. 

Lemma  1:  If  C„  has  a  limit  C,  finite  or  not,  then  lint,-,. 
X„£y.o  B,(j  )C„+J  —  C. 

Proof  of  Lemma  1:  We  consider  two  separate  cases 
depending  on  whether  C  is  finite. 

.  1) C  -  +». 

Fix  A  >0  and  pick  r  0  such  that  X,  4=  0.  There  exists  an 
integer  M  such  that  for  all  n  2:  M,  Cn  >  A .  Fix  such  an  M.  Then 
we  have  for  i  2  M 

a>  i  I  I 

2  K  2  B,U)C„+J>\r  2  B,U)Cj+r>\A  2  BAJ) 

n-C  y-0  7-0  .  7 -AT 

which  terminates  the  proof,  since  for  any  fixed  M  >  0 


lint  2  BiU)  =1-  03) 

2)  C  <  +oo. 

We  have  for  /  >  M 

2  X„  2  B'(j)Cn+J-C  s2  BAS)  2  k\c«+j-c\ 

nmO  y*0  Jm  o  nmQ 

+  2  B,U)  2  Xfl|Cn+y-C|.  (14) 

I  n*0 

Fix  e  >  0.  There  exists  M  such  that  for  all  n  >  M,  \C„  -  C|  < 
e/2.  Fix  such  an  M.  Then 

2  BAJ)  2  X„|Cfl+y-C|<|. 

j  "M+ !  /i*0 


Also,  if  L  is  an  upper  bound  for  C„ 

2  BAS)  2  Xfl|CB+;-C|^2L  2 

7-0  n-0  7-0  L 

for  /  big  enough  because  (13)  holds,  which  takes  care  of  the  first 
term  in  (14)  and  ends  the  proof  of  Lemma  1. 

Putting  together  (12)  and  Lemmas  A  and  1,  we  get  that  1)  if 
limn^»  C„  =  +«,  then  lim/-*  d/  =  -<»,  and  ( X„)nz0  is 
ergodicfand  2)  if  lim„^„  C„  =  C  <  +  »,  then  lim(-.M  rf,  =  X  - 
C,  and  ( X„)„i0  is  ergodic  for  X  <  C.  If  X  >  C,  we  can  apply 
Lemma  B  and  conclude  that  (X„)nz0  is  not  ergodic  provided  that 
Kaplan’s  condition  ii)  holds.  This  is  the  purpose  of  Lemma  2, 
which  is  the  last  step  in  the  proof  of  Theorem  1 . 

Lemma  2:  If  for  all  n  2  1,  C„  <  L  for  some  L  €  (0,  oo),  then 
Kaplan’s  condition  holds:  there  exists  a  constant  B,  an  integer  N, 
and  a  real  c  €  (0,  lj  such  that 

*‘-2  pUei* -B(  1-0)  all  / sjV,  9  £  [c,  1], 

J 

Proof  of  Lemma  2:  According  to  [25],  it  is  enough  to  show 
that  the  downward  part  of  the  drift,  defined  as 

/ 

D(i)=-  2  kPu-e 

km  | 

is  bounded  below.  From  the  transition  probabilities  (8),  we  get 
£('>-2  k  E  X"  £  *>())*„+,. n*k 

*■*1  /l-O  jmk 

which  can  also  be  put  in  the  form 

D(i)=-  2  BAJ)  2  X„  2 

7“I  rt-0  km  | 
from  which  it  follows  that 


DV)Z  -2  BiU)  2  KC„+jZ-L. 

Jm  1  rjmO 

o 

Notice  that  in  the  proof  of  Theorem  1  (and  this  also  holds  for 
Theorem  2  below),  the  exact  expression  (7)  for  B,(j)  is  never 
used.  The  only  requirements  are  that  (B,(j)) o5/S/  is  a  probability 
distribution,  and  that  (13)  holds.  Therefore,  our  results  are  valid 
for  a  larger  class  of  retransmission  policies  than  was  first 
assumed.  For  example,  there  could  be  K  priority  groups,  each 
with  a  different  retransmission  probability. 

Although  Theorem  1  is  quite  general,  in  many  practical  cases, 
the  reception  rhatrix  has  a  very  simple  structure  and  the  stability 
region  can  be  obtained  with  virtually  no  computations.  This 
happens  for  instance  in  radio  networks  with  capture  where  all  is 
needed  is  the  limit  of  the  second  column  of  the  matrix,  or  also  in 
the  simple  case  where  above  a  certain  collision  size  t V,  the 
transmission  is  too  garbled  for  the  receiver  to  be  able  to  decode 
anything  correctly,  so  that  C„  =  0  for  n  >  N. 

This  last  example  is  a  particular  case  of  a  noteworthy  feature  of 
Theorem  1 ,  namely  that  the  stability  region  does  not  depend  on 
any  finite  number  of  rows  of  the  reception  matrix.  In  fact,  any 
number  of  modifications  of  the  matrix  that  leaves  lim„-«  C„ 
unchanged  does  not  affect  the  stability  region.  Although  this  may 
be  surprising  at  first  sight,  it  can  be  intuitively  explained  by  the 
fundamental  instability  of  the  collision  channel:  unless  the 
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receiver  is  perfect  (all  enn  equal  to  1),  the  backlog  will  eventually 
exceed  any  prefixed  value  with  probability  one,  thus  it  is  the  limit 
of  C„  that  determines  the  stability  region. 

The  stability  region  is  also  unchanged  if  the  first  transmission  of 
packets  is  delayed.  If  new  packets  are  backlogged,  that  is, 
transmitted  for  the  first  time  with  probability  p  in  each  slot 
following  their  arrival  (this  transmission  rule  appears  in  the 
literature  as  controlled-access  or  delayed  first  transmission),  the 
drifts  become  dt  =  X  -  Sj«t  B,(j)Cj  for  /  >  1,  and  from 
Lemmas  1  and  2  the  ergodicity  region  remains  the  same. 

If  Cn  does  not  have  a  limit,  Theorem  1  does  not  give  the  stable 
throughput  of  the  system.  Even  though  in  almost  all  practical 
cases,  and  indeed  in  all  the  examples  of  Section  II,  C„  does  have  a 
limit,  it  is  conceptually  interesting  to  examine  the  case  when  lim 
inffl-.„  C„  &  lim  sup„-.„  C„.  It  is  worth  pointing  out  that  adding 
constraints  as  strong  as  the  following  on  the  reception  matrix  still 
does  not  imply  that  C„  has  a  limit: 

i)  («/io)/>ai  is  nondecreasing 

ii)  (tnk)nzk  is  nonincreasing  for  all  1 

iii)  for  ns2,  1  zksn-  1 

although  the  counterexamples  we  have  been  able  to  build  are 
somewhat  contrived.  Notice  that  conditions  i)  and  ii)  above  imply 
that  each  column  has  a  limit  ak  =  limn-M  enk(k  >  0),  which  is 
very  likely  to  happen  in  practice.  In  any  case,  Theorem  2  below 
still  gives  some  information  on  the  stability  region,  although  the 
exact  result  requires  in  general  the  complete  knowledge  of  the 
sequence  (Cn)ns.|.  In  fact,  given  any  nonnegative  numbers  a  <  y 
<  (3.  one  can  construct  a  reception  matrix  with  nth  row  average 
C„  such  that: 

i)  lim  inf  Cn~  a 

n—oo 

ii)  lim  sup  C„-fi 

n-*a» 

and  such  that  the  maximum  stable  throughput  is  7. 

Theorem  2:  The  system  is  stable  for  X  <  lim  infn-.<»  C„  and 
unstable  for  X  >  lim  sup„_<,  C„. 

Proof: 

a)  If  X  <  lim  inf„_„  C„,  then  (X„)„i.o  is  ergodic. 

If  lim  inf*-*  C„  =  +00,  then  lim,,-.,,  C„  =  +<»,  and  the 
result  has  already  been  proved,  so  assume  that  lim  inf„-.w  Cn  is 
finite  From  Lemma  A,  it  is  enough  lo  prove  that  for  all  e  >  0, 
there  exists  N  such  that 

di<\-  lim  inf  C„  +  e  all  ZaM 

n— 00 

Recall  from  (12)  that  we  have 

4«x-s*»Sw>c-+/-  (15) 

n«0  jmO 

So  it  is  o  ily  needed  to  prove  that  for  all  e  >  0  there  exists  N  such 
that 

co  i 

2  K  2  B,U)Cn+J>  lim  inf  Cn-e  all  i&M 
/i-o  ;-o 

Now  by  definition  there  exists  M  such  that  for  all  k  >  M: 

Ck>  lim  inf  C„~t 


and  therefore  for  all  i  >  M: 

2  X-  2  B<U)C„+,>( lim  inf  C„—e)  £  Bt(j) 

»■ 0  /■  0  "  °°  JmM 

which  completes  the  proof  since  (13)  holds, 
b)  If  X  >  lim  sup„-.M  C„,  then  (A'„)nio  is  not  ergodic. 

Since  X  is  finite,  in  this  case  lim  sup„«„  C„  is  necessarily  finite. 
Therefore,  (C„)nzi  is  bounded  and  from  Lemma  2,  Kaplan’s 
condition  holds.  Thus,  it  is  enough  to  show  that  for  all  e  >  0, 
there  exists  N  such  that 

di>\  -  lim  sup  C„-6  all  /aiV. 

/i-*co 

From  (15),  we  only  need  to  show 

00  / 

2^2  BiU)Cn+j<  lim  sup  C„  +  6  all  i>N. 

n-0  JmO  n~C° 

Since  there  exists  M  such  that  for  all  k  >  M 
Ck<  lim  sup  C„  +  s 

n—ao 

then  if  L  is  an  upper  bound  for  C„,  we  have  for  /  >  M 

co  I  AT- 1 

2  2  BiU)Cn+j<L  2  BiU)+  lim  sup  C„  +  e 

rt » 0  JmO  ]•  0 

from  which  the  result  follows,  using  (13).  □ 

IV.  Behavior  of  the  Backlog  Markov  Chain  in  the 
Nonergodicity  Region 

In  this  section,  we  further  investigate  the  properties  of  (Xn)„z;o 
in  the  case  X  >  C,  assuming  of  course  that  (C„)ni  t  has  a  finite 
limit.  It  has  been  proved  in  [1]  that  the  backlog  Markov  chain  for 
the  usual  slotted  Aloha  algorithm  is  transient,  but  this  result 
cannot  be  generalized  to  our  model  when  X  >  C.  We  give  below 
an  example  showing  that  (Ar„)„ao  can  be  null  recurrent  when  the 
mean  arrival  rate  X  belongs  to  an  interval  of  positive  length.  The 
boundary  between  the  null  recurrence  and  the  transience  regions 
generally  depends  in  a  rather  complicated  manner  on  both  the 
reception  matrix  and  the  retransmission  probability  p.  We  give  a 
sufficient  condition  for  ( Xn)nz0  to  be  transient  when  X  >  C,  as 
well  as  bounds  on  the  recurrence  region. 

Consider  the  reception  matrix  defined  by 

i»k=~i  (1  £k<,n) 
n 


for  n  5  I.  Then  C„  =  k/n1  =  (n  +  l)/2 n,  and  C  =  1/2. 
Using  Lemmas  C  and  D  below,  we  show  in  [26]  that  X„  is 
recurrent  for  X  <  R(p)  and  transient  for  X  >  R(p),  where  R(p) 
is  a  function  of  the  retransmission  probability  p  and  is  given  by 

R(P) =-+V-p- In  (l-p)  (0<p<l) 

p  pi 

R{  D=l. 

It  is  easily  seen  that  R(p)  is  an  increasing  function  of  p  for  p  E 
]0,  1[  with  extrema  linv-o  R(.P)  =  1/2  and  lim^-i  R(p)  =  1. 
Fig.  3  summarizes  the  behavior  of  Xn  for  this  example. 

It  is  somehow  surprising  to  see  that  in  this  case,  as  well  as  in  all 
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Fig.  3.  Transience  and  ergodicity  regions  as  a  function  of  the  retransmission 

probability  when  <„t  =  l//i\ 

die  other  examples  we  have  computed,  the  recurrence  region 
becomes  larger  as  p  increases.  Intuitively,  the  recurrence  of  Xn 
when  X  >  C  seems  to  be  due  to  the  fact  that  transitions  from  any 
state  i  to  0  (or  to  some  fixed  integer  kQ)  are  possible  and  that  the 
probability  of  such  an  event,  Pi0  (or  P,*0),  goes  to  zero  slowly 
with  /.  It  can  be  checked  that  these  probabilities  are  increasing 
functions  of  p  when  I  is  large  enough. 

Transience  is  ensured  for  X  >  C  if  the  supremum  of  the 
elements  of  the  foh-column  goes  to  zero  faster  than  k2.  This 
condition  holds  for  all  the  examples  in  Section  II,  as  well  as  for 
many  real  life  cases,  due  to  the  practical  limitations  on  the 
receiver  capabilities.  In  particular,  it  is  always  verified  if  the 
reception  matrix  has  only  a  finite  number  of  nonzero  columns  (or 
equivalently,  if  die  backlog  Markov  chain  has  uniformly  bounded 
downwards  transitions,  as  defined  in  [3])  which  happens,  for 
instance,  if  there  is  capture.  Note  that  the  proof  of  Theorem  3 
below  is  of  course  valid  for  the  conventional  collision  channel, 
and  in  this  case  becomes  somewhat  simpler  than  the  proof  in  [I], 

Theorem  3:  If  lim*-®  k 2  sup***  enk  =  0,  then  (X„)ni0  is 
transient  for  X  >  C. 

Because  of  the  complexity  and  lack  of  structure  of  the  one-step 
transition  probabilities  (8),  few  results  on  the  recurrence  and 
transience  of  Markov  chains  can  be  applied  to  our  model.  Before 
proving  Theorem  3,  let  us  introduce  the  following  two  criteria 
from  [27], 

Lemma  C:  Let  (X„)ni0  be  an  irreducible  and  aperiodic 
Markov  chain,  having  as  state  space  the  set  of  nonnegative 
integers,  and  with  one-step  transition  probability  matrix  P  = 
(P,j).  (X„)„i:o  is  recurrent  if  and  only  if  there  exists  a  sequence 
(yn)nzo  such  that 

1)  lim  yn=  +qo 

n~  co 

CO 

2)  for  some  integer  N> 0  ^  yjPij^yi  all  iszN. 

/- o 

We  will  only  use  the  sufficiency  part,  which  has  also  been  proved 
in  (24]. 

Lemma  D:  With  the  same  assumptions  as  in  Lemma  C, 
(X„)m:0  is  transient  if  and  only  if  there  exists  a  sequence  (y„)m o 
such  that 

1)  (Tn)nao  is  bounded 

2)  for  some  integer  Af>0  yjP,j^yi  all  />jV 

3)  for  some  k>N yk<y<u  **-,7/v- ■- 


Sufficiency  under  the  additional  constraints  yt>  0  and  lim,_«  y, 
=  0  has  also  been  proved  in  [28].  Also,  the  sufficiency  parts  of 
both  lemmas  are  an  immediate  consequence  of  [29,  Theorems  5 
and  6]  together  with  the  results  in  [30], 

Proof  of  Theorem  3:  We  use  Lemma  D  with  y„  =  l/(n  + 
l)9,  6  €  )0,  1[.  We  have 
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The  drift  of  Xn  at  state  /  can  be  computed  from  the  transition 
probabilities  (8) 
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The  idea  of  the  proof  is  to  show  that 

\m[D'(i)  +  U'(i)]=-9Ymd,  (21) 

/— 00  /—CD 

and  since  it  has  been  proved  in  Section  III  that  lim(_„  dt  =  X  - 
C,  we  will  be  able  to  conclude  that  (X„)„zo  is  transient  for  X  > 
C. 

1)  lim  [£>'(0  +  00(0)  =  0. 
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From  (18)  and  (20) 
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If  lun*-.«  fc27*  =  0,  then  limn-„  1/«S”  ,  k2yk  =  0.  So  we  can 
choose  /  large  enough  so  that  for  n  >  (/  +  l)/2.  /r27*  < 

«e.  Then 
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n  «0 


/+ 1  2 


Now  if  we  choose  /  big  enough  so  that  for  A:  >  (/  +  3)/2,  we  have 
7*  <  e/&2,  then 
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By  bounding  the  sum  in  the  last  equation  by  integrals,  it  can  be 
seen  that  it  is  upper  bounded  by  a  linear  function  of  /. 

2)  lim,_«  [U'(i)  +  0U(i)]  =  0. 

From  (18)  and  (20) 
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with,  assuming  for  instance  that  i  is  odd 
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By  using  the  following  inequalities: 
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We  show  that  x,(/)  and  x2(/)  go  to  zero  independently.  Fix  e  >  0. 
Define  for  0  <  X  S  /  the  function 
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It  is  easily  proved  that  for  each  /  >  1,  p,(x)  is  a  positive 
nondecreasing  function  of  x.  Also 
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where  /4  is  a  positive  constant  depending  only  on  0.  From  (23) 

o>  (1  +  l)/2  4  »  n  +  (l+ 1)/2 

jcj {/) = 2  k  2  klP‘(k)7n+k£—{  2  x»  2 

n.O  k*  1  1  n«0  *-1 


/I*  I 


n*N+ 1 


Fix  «  >  0.  Choose  iV  such  that  l^+,n\n  <  e/2,  and  then,  N 

being  fixed,  choose /large  enough  so  that  l/(i  +  l)S£.,rt  < 
e/2.  0 

It  should  be  clear  at  this  point  that  unlike  the  ergodicity  region, 
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the  recurrence  region  depends  in  general  on  the  elements  of  the 
reception  matrix  (instead  of  only  the  row  averages)  and  on  the 
retransmission  probability  p.  For  this  reason,  an  exact  expression 
for  the  recurrence  region  seems  rather  difficult  to  obtain; 
nonetheless,  the  method  (see  [26])  that  we  used  to  study  the 
example  in  Fig.  3  can  be  generalized  to  obtain  the  following  upper 
and  lower  bounds  on  the  recurrence  region. 

Theorem  4:  (A^ )„;.(,  is  recurrent  for  X  <  L  and  transient  for  X 
>  U,  with  L  =  max  {lu  sup0<9<i  /$,  sup0«i<i  /»' }  and  U  -  min 
(«i,  infold  ue,  inf0«,<i  ««'}  where 


“  '  n+-'  /  i+n+l  \ 

/,  =  lim  (i+  1)  S  Xn  S  B,U)  2  In  ii+n_-k—  )  «»+** 

n-0  J-  1  *-l  '  ' 


h~\  lim  (/+1)1-5  2  >n'2BiU) 

6HCC  n-0  /-I 


00  1 


2  [(/+«+  l)9-(/+n-*+l)9]f,,+M 

^•i 


i  ”  ' 

'*  =7  lim  (/+  1)  [In  (/+  l)]1-*  2  X»  S  B‘U) 


n-0  J-\ 


n+j 


2  [fin  (/  +  «+ 1)]9 -[In  (/+«-*  +  l)]^]«a+M 

Ar-l 


and 


m  =  Hm  (/+  1)  (In  (/+  l)]2  2  X"  S 

1  *  n-0  J-l 

/f  r _ i _ ! _ 

"  In  (/  +  /!  +  2-&)  ln(/+n  +  2) 


^  =  i  lim  (;+l)l+*2Xn2  BiU) 

V  i~ 00  A  , 


.  y  I _ i _ L__ 

LU+n  +  l-ky  {i+n+l y 
u;=Ui+l)  [In  (/+  1)],+9  2 

0  n-0  ;- i 

.  vy  r _ ! _ * _ 

"  [[In  (/+/» +2- A:)]9  (ln(/+n  +  2)]9 

We  are  assuming  that  the  limits  above  exist,  which  indeed 
happens  in  most  practical  cases.  The  theorem  is  valid  if  any  of 
these  limits  is  infinite.  In  particular,  if  L  =  +»,  then  X„  is 
always  recurrent.  Note  that  usually ,  it  is  not  necessary  to  carry  out 
all  the  computations,  because  one  of  the  three  terms  in  the 
definition  of  L  is  equal  to  one  of  the  terms  in  the  definition  of  U. 
In  fact,  in  most  cases,  we  have  sup0«j<i  /«  =  inf0<o<i  if  0  <  p 
<  1 ,  and  Mi  =  /,  if  p  =  1 .  The  proof  of  Theorem  4  can  be  found 
in  [26]. 


en*J,le 


References 


(1]  W.  A.  Rosenkrantz  and  D.  Towsley,  “On  the  instability  of  die  slotted 
Aloha  multiaccess  algorithm,  *  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-28,  pp.  994-996,  1983. 

(2]  G.  Fayolle,  E.  Gelenbe,  and  !.  Labetouile,  “Stability  and  optimal 
control  of  the  packet  switching  broadcast  channel,”  J.  Ass.  Comput. 
Mach.,  vol.  24,  no.  3,'pp.  375-386,  1977. 

(3]  M.  Kaplan,  “A  sufficient  condition  for  nonergodicity  of  a  Markov 
chain,”  IEEE  Trans.  Inform.  Theory,  vol.  IT-25,  pp.  470-471, 
1979. 

[41  D.  Bertsekas  and  R.  Gallager,  Data  Networks.  Englewood  Cliffs, 
NJ:  Prentice-Hall,  1987. 

[5]  B.  Hajek  and  T,  Van  Loon,  "Decentralized  dynamic  control  of  a 
multiaccess  broadcast  channel,”  IEEE  Trans.  Automat.  Contr.,  vol. 
AC-27,  pp.  559-569,  1982. 

[6]  B.  Hajek,  “Hiding-time  and  occupation-time  bounds  implied  by  drift 
analysis  with  applications,"  Adv.  Appl.  Prob.,  vol.  14,  pp.  502-525, 

1982. 

[7]  J.  N.  Tsitsiklis,  “Analysis  of  a  multiaccess  control  scheme,”  IEEE 
Trans.  Automat.  Contr.,  vol.  AC-32,  pp.  1017-1020,  1987. 

[8]  S.  Verdii,  “Minimum  probability  of  error  for  asynchronous  Gaussian 
multiple-access  channels,”  IEEE  Trans.  Inform.  Theory,  vol.  IT-32, 
pp.  85-96,  1986. 

[9]  D.  Raychaudhuri,  “Performance  analysis  of  random  access  packet- 
switched  code  division  multiple  access  systems,”  IEEE  Trans. 
Commun.,  vol.  COM-29,  pp.  895-901,  1981. 

[10]  A.  Polydoros  and  J.  Silvester,  “Slotted  random  access  spread-spectrum 
networks:  An  analytical  framework,"  IEEE  J.  Select.  Areas  Com¬ 
mon.,  vol.  SAC-5,  no.  6,  pp.  989-1002,  1987. 

[1 1]  M.  K.  Simon,  J.  K.  Gmura,  R.  A.  Scholtz,  and  B.  K.  Levitt,  Spread- 
Spectrum  Communications.  New  York:  Computer  Science  Press, 
1985. 

[12]  N.  Abramson,  “The  throughput  of  packet  broadcasting  channels," 
IEEE  Trans.  Commun.,  vol.  COM-25,  pp.  117-128,  1977. 

[13]  D.  J.  Goodman  and  A.  A.  M.  Saleh,  “The  near/far  effect  in  local 
Aloha  radio  communications,"  IEEE  Trans.  Vehicular  Technol., 
vol.  VT-36,  Feb.  1987. 

[14]  L.  G.  Roberts,  “Aloha  packet  system  with  and  without  slots  and 
capture,”  Comput.  Commun.  Rev.,  no.  5,  pp.  28-42,  1975. 

[15]  C.  Namislo,  “Analysis  of  mobile  radio  slotted  Aloha  networks,”  IEEE 
J.  Select.  Areas  Commun.,  vol.  SAC-2,  pp.  583-588,  1984. 

[16]  D.  H.  Davis  and  S.  A.  Gronemeyer,  “Performance  of  slotted  Aloha 
random  access  with  delay  capture  and  randomized  time  of  arrival," 
IEEE  Trans.  Commun.,  vol.  COM-28,  pp.  703-710,  1980. 

[17]  J.  C.  Ambak  and  W.  van  Blitterswijk,  "Capacity  of  slotted  Aloha  in 
Rayleigh  fading  channels,”  IEEE  J.  Select.  Areas  Commun.,  vol. 
SAC-5,  no.  2,  pp.  261-269,  1987. 

[18]  C.  C.  Lee,  “Random  signal  levels  for  channel  access  in  packet  radio 
networks,”  IEEE  J.  Select.  Areas  Commun.,  vol.  SAC-5,  no.  6,  pp. 
1026-1034,  1987. 

[19]  B.  S.  Tsybakov,  V.  A.  Mikhailov,  and  N.  B.  Likhanov,  "Bounds  for 
packet  transmission  rate  in  a  random  multiple  access  system,” 
Problemy  Peredachi  Informatsii,  vol.  19,  no.  1,  pp.  61-81.  1983. 

[20]  N.  Mehravari,  "Collision  resolution  in  random  access  systems  with 
multiple  reception,”  preprint,  1987. 

[21]  A.  Shwartz  and  M.  Sidi,  “Erasure,  capture  and  noise  errors  in 
controlled  multiple  access  networks,"  in  Proc.  25th  Conf.  Decision 
Contr.,  Dec.  1986,  pp.  1333-1334. 

[22]  R.  E.  Kahn,  S.  A.  Gronemeyer,  J.  Burchfiel,  and  R.  C.  Kunzelman, 
"Advances  in  packet  radio  technology,"  Proc.  IEEE,  vol.  66,  pp. 
1468-1496,  1978. 

[23]  W.  Szpankowski,  "Analysis  and  stability  considerations  in  a  reserva¬ 
tion  multiaccess  system,”  IEEE  Trans.  Commun.,  vol.  COM-31,  no. 
5,  pp.  684-692,  1983. 

[24]  A.  G.  Pakes,  “Some  conditions  for  ergodicity  and  recurrence  of 
Markov  chains,"  Oper.  Res.,  vol  17,  pp.  1058-1061,  1969. 

[25]  L.  I.  Sennott,  P.  A.  Humblet,  and  R.  L.  Twcedie,  “Mean  drifts  and  the 
non-ergodicity  of  Markov  chains,”  Oper.  Res.,  vol.  31,  pp.  783-789, 

1983. 

[26]  S.  Ghez,  S.  Verdu,  and  S.  C.  Schwartz,  "Random  access  communica¬ 
tion  with  multipacket  reception  capability.  I:  Stability  of  Aloha,”  Dep. 
Elect.  Eng.,  Princeton  Univ.,  Tech.  Rep.  ISS.  8748,  1987. 

[27]  J.  F.  Mertens,  E.  Samuel-Cahn,  and  S.  Zamir,  “Necessary  and 
sufficient  conditions  for  recurrence  and  transience  of  Markov  chains,  ,n 
terms  of  inequalities,"  J.  Appl.  Prob.,  vol.  15,  pp.  848-851,  1978. 

[28]  C.  M.  Harris  and  P.  G.  Marlin,  “A  note  on  feedback  queues  with  bulk 
service,”  J.  Assoc.  Mach.,  vol.  19,  pp.  727-733,  1972. 

[29]  F.  G.  Foster,  “On  the  stochastic  matrices  associated  with  certain 
queuing  processes,"  Ann.  Math.  Statist.,  no.  24,  pp.  355-360, 1953. 

[30]  P.  Vit,  "On  the  equivalence  of  certain  Markov  chains,”  J.  Appl. 
Prob.,  vol.  13,  pp.  357-360,  1976. 

[31]  S.  Ghez,  S.  Verdu,  and  S.  C.  Schwartz,  "On  decentralized  control 
algorithms  for  muitipacket  Aloha,”  in  Proc.  25th  Allerton  Conf.  on 
Commun.,  Contr.,  and  Comput.,  Oct.  1987. 


GHEZ  t t  a/.:  STABILITY  PROPERTIES  OF  SLOTTED  ALOHA 


649 


Sylvie  Ghee  (S’88)  was  bom  in  Paris,  France,  on 
June  3,  1961.  She  received  the  engineering  dip¬ 
loma  from  the  Ecolc  Nationale  Superieure  des 
Telecommunications,  Paris,  in  1984,  and  the  M.S. 
degree  in  electrical  engineering  from  Princeton 
University,  Princeton,  NJ,  in  1987,  where  she  is 
currently  a  doctoral  candidate. 

She  is  the  recipient  of  the  Wallace  Memorial 
Fellowship  for  the  year  1987-1988.  Her  research 
interests  include  multiple  access  protocols,  network 
routing  problems,  queueing  theory,  and  applied 
probability  theory. 


Sergio  Verdti  (S’80-M'84)  received  the  Ph.D. 
degree  in  electrical  engineering  from  the  University 
of  Illinois  at  Urbana-Champaign  in  1984. 

Upon  completion  of  his  doctorate  he  joined  the 
faculty  of  Princeton  University,  Princeton,  NJ, 
where  he  is  an  Assistant  Professor  of  Electrical 
Engineering  His  recent  research  contributions  are 
in  the  areas  of  multiuser  communication  and  in¬ 
formation  theory,  and  statistical  signal  process¬ 
ing. 

Dr.  Verdti  is  a  recipient  of  the  National 


University  Prize  of  Spain,  the  Rhcinstein  Outstanding  Junior  Faculty  Award 
of  the  School  of  Engineering  and  Applied  Science,  Princeton  University,  and 
the  NSF  Presidential  Young  Investigator  Award.  He  is  currently  serving  as 
Associate  Editor  of  the  IEEE  TRANSACTIONS  ON  AUTOMATIC  CONTROL. 


Stuart  C.  Schwartz  (S’64-M‘66-SM’83)  was  bom 
in  Brooklyn,  NY,  on  July  12,  1939.  He  received  the 
B.S.  and  M.S.  degrees  in  aeronautical  engineering 
from  the  Massachusetts  Institute  of  Technology, 
Cambridge,  in  1961,  and  the  Ph.D.  degree  from  the 
Information  and  Control  Engineering  Program, 
University  of  Michigan,  Ann  Arbor,  in  1966. 

While  at  M.I.T.  he  was  associated  with  the  Naval 
Supersonic  Laboratory  and  the  Instrumentation 
Laboratory.  From  1961  to  1962  he  was  at  the  Jet 
Propulsion  Laboratory,  Pasadena,  CA,  working  on 
problems  in  orbit  estimation  and  telemetry.  He  is  presently  a  Professor  of 
Electrical  Engineering  and  Chairman  of  the  department  at  Princeton 
University,  Princeton,  NJ.  He  served  as  Associate  Dean  of  the  School  of 
Engineering  from  July  1977  to  June  1980.  During  the  academic  year 
1972-1973,  he  was  a  John  S.  Guggenheim  Fellow  and  a  Visiting  Associate 
Professor  at  the  Department  of  Electrical  Engineering,  Techmon— Israel 
Institute  of  Technology ,  Haifa,  Israel.  During  the  academic  year  1980-1981, 
he  was  a  member  of  the  Technical  Staff  at  the  Radio  Research  Laboratory, 
Bell  Telephone  Laboratories,  Crawford  Hill,  NJ.  His  principal  research 
interests  are  in  the  application  of  probability  and  stochastic  processes  to 
problems  in  statistical  communication  and  system  theory. 

Dr.  Schwartz  ts  a  member  of  Sigma  Gamma  Tau,  Eta  Kappa  Nu,  and  Sigma 
Xi.  He  has  served  as  an  Editor  for  the  SIAM  Journal  on  Applied 
Mathematics  and  as  Program  Chairman  for  the  1986  ISIT. 


T-AC/34/9//29606 


Control  and  Optimization  Methods  in 
Communication  Network  Problems 


Anthony  Ephremides 
Sergio  Verdu 


Reprinted  from 

IEEE  TRANSACTIONS  ON  AUTOMATIC  CONTROL 
Vol.  34,  No.  9,  September  1989 


93l 


IEEE  TRANSACTIONS  ON  AUTOMATIC  CONTROL,  VOL.  34.  NO.  9,  SEPTEMBER  1989 


Control  and  Optimization  Methods  in 
Communication  Network  Problems 

ANTHONY  EPHREMIDES,  fellow,  ieee,  and  SERGIO  VERDU,  senior  member,  ieee 


Abstract — In  this  paper  we  focus  on  two  areas  of  communication 
network  design  in  which  methods  of  control  and  optimization  theory  have 
proven  useful,  .nese  are  the  area  of  multiple  access  communication  (for 
networks  with  shared  links  such  as  radio  networks  and  local  area 
networks)  and  the  area  of  network  routing  (for  networks  with  point-to- 
point  interconnections).  We  review  a  few  selected  problems  in  each  area 
to  show  the  role  of  the  control  concepts  involved  and  we  then  proceed  to 
identify  other  areas  of  communication  network  design  in  which  the  same 
control  theoretic  and  optimization  methodology  may  be  applicable  and 
useful.  We  do  not  survey  the  work  done  in  this  area,  nor  do  we  review 
work  in  control  areas  whose  methods  are  applicable  in  other  communica¬ 
tion  network  problems.  Instead,  we  attempt  to  bring  to  the  attention  of 
the  control  systems  community  the  numerous  instances  of  problems 
arising  in  the  pure  communication  network  design  process  that  can 
benefit  from  (he  attention  and  the  capabilities  of  this  community. 


I.  Introduction 

COMMUNICATION  networks  are  designed  and  built  in  order 
to  share  resources.  If  interconnecting  systems  and  bandwidths 
were  available  at  no  cost,  then  the  solution  to  the  problem  of 
communication  would  be  to  assign  dedicated  communication  links 
(channels)  of  sufficient  capacity  to  every  pair  of  conceivable  users 
to  meet  their  needs.  This  not  being  the  case,  it  is  necessary  to 
multiplex  the  sources  of  communication  traffic  in  order  to 
optimize  various  cost  criteria.  Frequently,  this  optimization  is 
dynamic  and  done  on  the  basis  of  feedback  that  monitors  the 
evolution  of  the  degree  of  utilization  of  the  network  resources. 
Thus,  vye  should  expect  a  number  of  problems  arising  in 
communication  network  design  to  fit  naturally  in  the  framework 
of  control  systems  design.  In  this  paper  we  wish  to  demonstrate 
that  indeed  this  is  the  case  and  to  show  how  various  control  and 
optimization  methodologies  have  been  used  in  the  study  of 
communication  networks. 

In  the  beginning  there  was  a  single  communication  network,  the 
telephone  network.  It  represented  a  multibillion  dollar  investment 
and  seemed  to  serve  reasonably  adequately  the  voice  communica¬ 
tion  needs.  The  explosive  growth  in  data  communication  needs 
during  the  last  30  years  built  up  the  pressure  for  additional  and 
alternative  networking  options.  As  a  result,  the  notion  of  store- 
and-forward  switching  (known  also  as  message  switching)  was 
introduced  in  the  early  1960’s.  This  notion  represented  a 
breakthrough  since  it  constituted  a  radical  reversal  of  thinking 
with  respect  to  the  circuit-switching  process;  namely,  instead  of 
securing  an  open,  dedicated  “pipe”  for  the  transmission  of 
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messages  by  means  of  hardware  switches,  it  allowed  a  step-by- 
step  (node-by-node)  forwarding  of  messages,  thereby  permitting 
each  node  to  switch  messages  by  deciding  when  and  where  to 
transmit  the  messages  in  its  buffer.  In  the  last  20  years  we  have 
seen  an  avalanche  of  technologies  (fast  switching,  time  division 
multiplexing,  local  area  networks,  fiber  optical  networks,  inte¬ 
grated  services  digital  networks,  etc.)  and  a  proliferation  of 
operational  public  and  private  networks  that  put  these  technolo¬ 
gies  to  test  and  challenged  communication  engineers.  In  addition, 
they  should  challenge  control  engineers  as  well. 

Without  attempting  a  survey  of  this  vast  application  area  we 
wish  to  promulgate  the  viewpoint  that  many  (if  not  most)  specific 
sub-problems  in  the  network  design  process  are  natural  control 
problems.  In  support  of  this  thesis,  we  choose,  first,  to  demon¬ 
strate  how  two  major  areas  in  communication  networks  (routing 
and  multiple  access)  have  benefitted  from  the  use  of  techniques 
borrowed  from  what  is  traditionally  perceived  as  control  systems 
methodology  and,  second,  to  mention  additional  areas  that  are 
likely  to  benefit  from  the  control  systems  community.  As 
illustrated  in  this  paper,  the  techniques  that  have  proved  useful  in 
communication  networks  include:  dynamic  programming  (e.g., 
m,  [6],  [8]— [10],  [22],  [29],  [38],  [39],  [47],  [49],  [54]);  linear 
programming  (e.g.,  [50],  [51]);  constrained  and  iterative  optimi¬ 
zation  (e.g.,  [5],  [14],  [16],  [42]);  Markov  decision  theory  tools 
(e.g.,  [2],  [26],  [29],  [38]);  control  of  Markov  chains  (e.g.,  [11], 
[17],  [18],  [20],  [40],  [45]);  stability  analysis  of  stochastic 
systems  via  Lyapunov  methods  (e.g.,  [31],  [43]);  sample  path 
dominance  (e.g.,  [2],  [52]);  and  convergence  of  distributed  and 
asynchronous  algorithms  (e.g.,  [6],  [16],  [42]). 

The  problem  of  routing  is  encountered  in  all  and  every  network 
that  does  not  permit  the  source  to  reach  the  destination  in  a  single 
transmission  hop,  but  instead  it  must  traverse  a  path  of  intermedi¬ 
ate  links.  By  contrast,  the  problem  of  multiple  access  is 
encountered  primarily  in  those  networks  that  permit  the  nodes  to 
reach  their  destination  directly  in  one  hop  by  having  to  share  the 
same  link  with  other  transmitting  nodes.  In  addition,  the  two 
problems  are  fundamentally  different  in  nature  and,  jointly,  cover 
considerable  ground  in  the  networking  area.  Finally,  together  they 
facilitate  the  identification  of  additional  design  issues  and  the 
extension  of  the  applicability  of  suitable  control  methods.  Thus, 
they  represent  “cornerstone”  areas  of  network  design. 

Routing  can  be  studied  either  macroscopically  or  microscopi¬ 
cally.  The  macroscopic  viewpoint  considers  basically  a  flow 
model  and  determines  the  splitting  of  the  flow  in  order  to  reach  the 
destination  in  minimum  time  with  efficient  use  of  the  network 
resources.  It  is  traditionally  referred  to  as  static  routing.  The 
microscopic  viewpoint  dissects  the  flow  process  down  to  the 
atomic  level  of  the  individual  transmission  unit,  the  message  (a 
string  of  bits  commonly  referred  to  as  packet),  and  determines  the 
path  each  message  must  follow  at  each  of  its  hops  through  the 
network.  It  is  traditionally  referred  to  as  dynamic  routing.  Both 
viewpoints  are  explored  in  Section  II. 

Multiple  access  is  a  collective  term  that  refers  to  numerous 
problems  that  deal  with  the  dynamic  allocation  of  a  single 
resource  among  users  who  can  coordinate  their  use  of  that 
resource  only  by  making  use  of  that  resource.  These  problems 
arise  primarily  in  the  context  of  radio  channels  but  also  in  the 
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context  of  shared  cable  resources  in  local  area  networks.  In 
Section  III,  we  explore  the  main  multiple  access  problems  where 
control  methods  have  been  successfully  applied. 

Beth  in  the  case  of  routing  as  well  as  in  the  case  of  multiple 
access  we  place  the  empnasis  on  the  control  techniques  that  have 
been  used.  We  then  show  how  these  techniques,  sometimes  with 
slight  modification,  can  be  naturally  transported  to  other  problem 
areas  such  as  voice-data  integration,  flow  control,  and  the 
scheduling  of  messages  and  links.  This  is  done  in  Section  IV. 

II.  Network  Routing 

The  problem  of  routing  in  communication  networks  is  one  that 
has  received  early  attention  and  has  experienced  significant 
breakthroughs  in  the  brief  history  of  the  field  of  communication 
networks.  It  is  one  of  the  first  problems  that  gained  prominence  as 
a  .csult  of  the  emergence  of  store-and-forward  switching.  It  is  also 
one  in  which  analytical  tools  and  available  theories  applied  nicely 
from  the  beginning. 

A.  Static  Routing 

Given  a  network  (a  set  of  nodes  connected  by  directed  links)  a 
path  connecting  the  source  node  to  the  destination  node  has  to  be 
selected  from  the  set  of  all  possible  such  paths. 1  Ip.  the  simplest 
formulation,  the  problem  is  one  of  Finding  the  shortest  path,  i.e. ,  a 
length  is  assigned  to  each  link  and  the  optimization  criterion  is  the 
total  path  length.  This  problem  is  one  of  the  archetypical 
combinatorial  optimization  problems  (the  solution  can  be  found  by 
exhaustive  enumeration  of  a  finite  set  of  possibilities— all  possible 
paths  from  source  to  destination).  Among  the  many  existing 
shortest  path  algorithms  (see,  e.g.,  [41]),  the  Bellman-Ford 
algorithm  (1956)  is  of  particular  interest  to  our  exposition,  both 
because  it  is  based  on  dynamic  programming  and  because,  as  we 
will  see  below,  it  easily  lends  itself  to  distributed  asynchronous 
implementation.  A  natural  choice  to  find  the  shortest  path  from 
source  to  destination  m  a  layered  network  (i.e.,  one  in  which  the 
nodes  can  be  grouped  in  subsets  Uu  •••  such  that  the  source 
and  destination  nodes  belong  to  U\  and  0M,  respectively,  and 
there  are  links  only  between  nodes  in  adjacent  layers  t/*_i  and 
Uic)  such  as  the  one  in  Fig.  1,  is  the  dynamic  programming 
algorithm,  where  the  shortest  paths  and  distances  (costs-to-go)  of 
the  nodes  in  layer  Uk  to  the  destination  are  computed  based  on  the 
shortest  paths  and  distances  of  the  nodes  in  layer  (/*+,.  If  the 


1  All  the  algorithms  and  results  discussed  in  this  section  can  be  extended  to 
the  case  where  there  are  several  source-destination  pairs  in  the  network. 


Fig.  2.  Arbitrary  network  showing  link  lengths.  Source  is  node  1  and 
destination  is  node  5. 

network  is  not  layered  (such  as  that  in  Fig.  2),  its  shortest  path  can 
be  obtained  by  Finding  the  shortest  path  in  a  layered  network 
derived  from  the  original  one  as  specified  in  the  Bellman-Ford 
algorithm:  the  number  of  layers  is  equal  to  the  number  of  nodes  in 
the  original  network,  say  N,  each  layer  contains  a  copy  of  each  of 
the  N  nodes,  and  there  is  a  link  connecting  two  nodes  in 
consecutive  layers  if  such  a  link  exists  in  the  original  network,  in 
addition,  copies  of  the  same  node  in  consecutive  stages  are 
connected  by  a  zero-length  link.  (Fig.  1  was  actually  derived  from 
Fig.  2  using  this  rule.)  It  is  easy  to  see  by  induction  that  £>*(/),  the 
cost-to-go  of  node  i  in  lay  ,r  N  -  k,  is  the  minimum  length  of  any 
path  from  i  to  the  destination  that  uses  at  most  k  links  (in  the 
original  network).  Since  no  shortest  path  uses  more  than  N  -  1 
links  (link  lengths  are  assumed  nonnegative  and,  therefore,  no 
path  containing  loops  need  be  considered),  the  cost-to-go  of  node  i 
at  layer  1,  £>W-i(0  will  indeed  be  the  length  of  the  shortest  path 
from  node  /  to  the  destination.  Thus,  the  Bellman-Ford  algorithm 
can  be  formulated  as  the  iteration 

D*(0«  min  [Z?*.,(y)  +  dv]  for  A:  =  1 ,  •••  N-l  (2.1) 
jem 

where  dtJ  is  the  length  of  the  link  from  i  to  j,  N(i)  is  the  set  of 
nodes  for  which  such  a  link  exists  and  it  is  assumed  that  A>0 )  = 
oo  if  i  is  not  the  destination  node,  which  corresponds  to  the 
removal  of  all  the  nodes  but  the  destination  in  the  final  layer  (Fig. 
1). 

4  Contrary  to  what  may  appear  at  first  glance  there  is  a  lot  more 
to  network  routing  than  Finding  shortest  paths.  After  all,  the 
shortest  path  may  not  be  the  best  path.  The  reason  is  that  the  real 
goal  is  to  minimize  the  delay  experienced  in  going  from  source  to 
destination,  and  the  delay  encountered  in  each  link  is  usually  a 
function  of  the  amount  of  traffic  carried  by  the  link  (as  the  link 
becomes  congested,  it  takes  longer  to  go  through  it),  which  is 
referred  to  as  the  link  flow  and  is  quantified  in  packets  (or 
messages)  per  second.  Then,  assuming  a  given  desired  flow  level 
from  source  to  destination,  the  problem  is  how  to  distribute  it 
among  all  the  possible  paths  so  as  to  minimize  the  total  delay  In 
contrast  to  the  previous  more  elementary  formulation  of  the 
routing  problem  which  led  to  the  shortest  path  combinatorial 
optimization  problem  and  which  corresponds  to  the  special  case  in 
which  the  link  delays  are  independent  of  the  flows,  we  now  face  a 
continuous  optimization  problem  which  can  be  written  as 

minimize  F(x)  =  2  D,J  (  2  x(n) ) 

(U)  ' nepu.j )  / 

subject  to  x  €  X=  joc(I),  •  •  -x(J))  €  RJ, 

2  x(n)  =  X,  *(n)>ol  (2.2) 

/i-i  ) 
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Fig  3  Characterization  of  the  solution  to  the  minimum-delay  routing 

problem. 

where  the  set  of  all  paths  from  source  to  destination  is  labeled  { 1, 
jc  =  (x(l),  x(Jj)  is  the  vector  of  unknown 
nonnegative  path  flows  which  sum  up  to  X,  the  desired  flow  from 
source  to  destination;  P(i,  j )  C  {1,  •••,  /}  is  the  subset  of  paths 
that  traverse  link  (/,  j);  and  Djj(x)  is  the  portion  of  the  overall 
delay  contributed  by  the  link  from  node  f  to  node  j  when  the  flow 
it  carries  is  equal  to  x.  In  order  to  characterize  a  global  solution  to 
the  optimization  over  a  convex  set  in  (2.2),  it  is  natural  to  restrict 
attention  to  convex  penalty  functions.  In  practice,  it  is  common 
that  the  incremental  delay  in  a  link  grows  with  the  amount  of 
traffic  it  carries  and,  therefore,  it  can  be  assumed  that  the 
functions  D-,-,  are  convex  without  affecting  significantly  the 
practical  applicability  of  the  results. 

Now,  the  characterization  of  the  solution  to  (2.2),  x*,  is 
straightforward.  Since  the  feasible  set  X  and  the  penalty  function 
F  are  convex,  it  is  necessary  and  sufficient  that  the  directional 
derivative  of  the  penalty  function  be  nonnegative  when  evaluated 
at  x*  in  the  direction  of  any  of  the  elements  of  X  (e.g.,  [37]) 

O^Iim-[F((l-a)x*  +  ax)-F(x*)]  Vx  €  X  (2.3) 

aio  a 

which  translates  into 

0sS  D'j  (  £  X  [*(«)-**(«)] 

OJ)  \m6P(ij)  /  nfzP(lj) 

J 

«  ^  [*(«)-**(«)]  </,.(«)  for  all  x  €  *  (2.4) 

n-l 

where  dx(n)  =  SMei(n)£>'(Sm6W./1x+(wi))  is  the  length  of  path 
n  when  the  length  of  each  link  is  equal  to  the  derivative  of  its  delay 
evaluated  at  the  set  of  flows  x,  and  L(n)  is  the  set  of  links  used  by 
path  n.  The  solution  to  (2.4),  x*,  is  the  vector  in  X  that  minimizes 
its  inner  product  with  the  vector  of  distances  dx*.  Thus,  x*  puts 
all  its  weight  on  the  smallest  components)  of  dx+.  The  conclusion 
is  that  the  optimum  flow  uses  only  shortest  paths  computed 
according  to  the  derivative  of  the  link  delays. 

This  solution  to  the  minimum-delay  routing  problem  allows  us 
to  check  whether  a  given  set  of  flows  is  optimum.  Unfortunately, 
it  does  not  tell  us  how  to  find  the  optimum  flows.  Indeed,  we  face 
the  chicken-and-egg  situation  depicted  in  Fig.  3.  The  optimum 
flows  are  obtained  by  solving  a  shortest  path  problem;  but  in  order 
to  compute  the  link  lengths  it  is  necessary  to  know  the  optimum 
flows.  Nevertheless,  the  foregoing  characterization  of  the  optimal 
solution  does  suggest  a  possible  iterative  procedure  to  find  the 
optimum  set  of  flows.  Starting  with  a  given  set  of  flows  x  one  can 
compute  the  minimum  derivative  shortest  paths  for  that  flow,  and 
hence,  a  new  flow,  x*(x)  that  is  positive  only  along  those  shortest 
paths.  The  process  can  then  be  repeated,  until  there  is  no 
appreciable  cost  decrease.  The  region  of  convergence  of  such  a 
procedure  can  be  improved  by  letting  the  new  flow  be  a  convex 
combination  of  x  and  x*(x),  i.e., 

xi+,  =  (l  -ctk)xk+akx*(xk). 

This  is  the  so-called  flow  deviation  method  of  Fratta,  Gerla,  and 
Kleii.'rock  [14],  where  0  <  ak  ^  1  is  chosen  to  minimize 

F{(\-ak)xk+ctkx*{xk)) 


which  is  a  special  case  of  the  feasible-direction  nonlinear 
programming  algorithm  due  to  Frank  and  Wolfe  [13].  The 
convergence  of  the  flow  deviation  method  to  the  optimum  routing 
is  rather  slow  because  unfavorable  paths  tend  to  carry  considera¬ 
ble  flow  during  many  iterations  unless  the  initial  routing  guess  is 
particularly  fortuitous.  Such  a  behavior  can  be  improved  by 
reducing  the  flow  along  each  nonminimum  derivative  path  in 
accordance  to  the  delay  experienced  in  that  path.  This  is  the  idea 
of  iterative  routing  algorithms  based  on  gradient  projection 
nonlinear  optimization  methods  (e.g.,  [4])  in  which  the  flow 
decrease  along  a  nonminimum  derivative  path  is  proportional  to 
the  difference  between  its  length  and  that  of  the  shortest  path 
(according  to  the  first  derivative  of  the  delay  function).  If  such  a 
decrease  would  result  in  a  negative  flow,  then  the  flow  along  that 
path  is  set  to  zero  (hence,  the  projection  to  the  set  of  feasible 
flows). 

We  have  seen  that  the  problem  of  static  network  routing  can  be 
formulated  as  a  conceptually  straightforward  optimization  prob¬ 
lem  that  admits  well-known  solutions  in  nonlinear  programming. 
What  sets  optimum  routing  in  communication  networks  apart 
from  other  multicommodity  flow  problems  arising  in  operations 
research  is  the  fact  that  the  optimization  is  carried  out  in  real 
time,  and  often,  in  distributed  fashion,  where  each  node  makes  its 
own  routing  decisions  based  on  local  information.  The  review  of 
centralized  routing  has  revealed  that  the  shortest  path  problem 
plays  a  central  role  in  solving  for  the  optimum  routing  regardless 
of  whether  the  link  congestion  measures  depend  on  the  link  flow 
or  not.  Hence,  we  will  start  the  exposition  of  distributed  routing 
algorithms  by  discussing  the  distributed  version  of  the  Bellman- 
Ford  shortest  path  algorithm. 

The  Bellman-Ford  updating  equation  in  (2.1)  suggests  that  the 
algorithm  is  suited  for  decentralized  operation  because  each  node 
can  update  its  own  estimate  of  distance  to  the  destination  (cost-to- 
go)  provided  it  receives  from  its  neighbors  their  own  estimates 
[appearing  on  the  right-hand  side  of  (2. 1)].  The  feature  that  makes 
the  study  of  the  distributed  Bellman-Ford  algorithm  interesting  is 
that  it  can  run  completely  asynchronously,  in  the  sense  that  the 
updating  and  communication  times  need  not  be  coordinated  and 
convergence  can  be  guaranteed  by  simply  assuming  that  updating 
and  communication  between  nodes  never  cease,  without  any 
requirements  whatsoever  on  the  rate  of  communication.  The  proof 
of  convergence  is  a  nice  illustration  of  the  analysis  of  decentral¬ 
ized  algorithms  where  the  processors  are  allowed  to  perform  their 
computations  and  to  communicate  the  corresponding  results 
completely  independently  of  one  another  [5],  [6].  The  idea  is  to 
show  that  the  estimates  computed  in  the  distributed  asynchronous 
algorithm  are  always  sandwiched  by  the  estimates  computed  by 
the  centralized  version  of  the  algorithm  when  started  at  two 
different  initial  conditions,  and  that  both  centralized  estimates 
converge  to  the  true  distances  to  the  destination  node. 

Those  centralized  estimates  are  denoted  by  Dk  =  (I>k(l),  •  •  • , 
Dk{N))  and  Dk  =  (Dk(  1),  •  •  •,  Dk{N)),  and  are  the  result  of 
the  centralized  Bellman-Ford  iteration  (2.1)  when  it  is  started 
with  initial  conditions  D0  =  («,  •  •  • ,  oo,  0)  and  D0  =  (0,  •  •  • , 
0),  respectively.  (The  destination  node  is  assumed  to  be  the  Afth 
node.)  Define  the  operator  [see  (2.1)] 

B,[Dk]=  nun  [DkU)  +  dy] 
jem 

-Dk+i(i)  (2.6) 

tfl  £  i  <  N,  and  BN[Dk]  =  Dk(N).  This  operator  is  monotone 
in  the  sense  that  if  D  <  D*  (i.e.,  if  D(i)  <,  D*(i),  i  =  1,  •  •  ■ 
N),  then 

BAD]  (2.7) 

The  monotonicity  of  B\  implies  that 

(2.8) 


t 


* 

1 


I 


pk£Dk+i^Dk+i£Dk 
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and,  moreover,  it  is  easy  to  show  that  for  sufficiently  large  k 


Qk  —  Av- 1  =  Ok 


(2.9) 


which  is  the  vector  of  distances  from  each  node  to  destination  as 
we  saw  in  the  discussion  of  the  centralized  algorithm. 

In  the  asynchronous  distributed  version  of  the  algorithm,  it  is 
assumed. that  each  node  /  keeps  at  time  /  £  0  an  estimate  of  its 
distance  to  destination  A,(i),  and  an  estimate  of  the  distance  from 
each  of  its  neighbors  j  €  N(i)  to  destination  A\(j ),  which  is 
simply  the  latest  estimate  received  from  nodey.  In  view  of  (2.8) 
and  (2.9),  convergence  of  the  algorithm  will  follow  if  we  show 
that  for  every  index  k,  there  exists  a  time  tk>  0  such  that  for  all  / 
&  tk 

PkSA.zDk  (2.10) 

and  for  i  =  1 ,  •  •  • ,  N  -  1 

DkU)*A‘U)*OkU)  jeN(i).  (2.11) 

This  is  shown  by  induction.  If  k  =  0,  then  (2. 10)  and  (2. 1 1)  hold 
as  long  as  the  initial  estimates  of  the  decentralized  algorithm  are 
nonnegative.  Assuming  that  the  induction  hypothesis  is  true  for  k , 
the  monotonicity  of  B,  implies  that  if  t  >  4,  then 


pk+i(i)  =  BilPk]  ^BAA  J]  <; B,[Ok )  =  Ok+ 1  (i).  (2. 12) 

But  A,(i)  is  a  piecewise  constant  function  of  time  which  only 
jumps  at  the  updating  times  of  node  /,  at  which  times  it  takes  the 
value 


Fig.  4.  Queueing  model  of  a  node  with  one  incoming  link  and  two  outgoing 

links. 


than  the  path  flows.  Tsitsiklis  and  Bertsekas  [42]  showed  the 
convergence  of  the  distributed  asynchronous  implementation  of 
gradient  projection  optimal  routing  algorithms  provided  the  time 
between  consecutive  broadcasts  is  small  enough  relative  to  the 
speed  at  which  the  flows  generated  by  the  algorithm  change.  The 
approach  for  showing  the  stability  of  this  algorithm  is  very 
different  from  the  proof  of  convergence  of  the  distributed 
Bellman-Ford  algorithm  where  the  monotonicity  of  the  dynamic 
programming  mapping  implied  that  the  estimates  are  closer  and 
closer  to  the  solution  regardless  of  the  actual  sequence  of 
communication  and  computation  times.  The  idea  here  is  that  if  the 
step  size  of  the  algorithm  is  small  enough,  then  the  flows  change 
so  slowly  with  respect  to  the  periods  between  communication 
times  that  their  evolution  is  very  close  to  that  of  the  centralized 
algorithm  which  uses  the  unique,  true  value  of  each  link  flow. 

B.  Dynamic  Routing 


A,0)  =  Bl{A‘,]. 

Therefore,  we  can  write 

Z>*+1(/)sMi(i)s;At+i(/)  for  t^tk(i)  (2.13) 

where  tk(i)  is  the  smallest  updating  time  of  node  /  which  is  greater 
than  tk.  Moreover,  if  we  wait  long  enough  after  max,  4(0*  not 
only  ail  the  nodes  will  have  carried  out  their  first  updates  after  tk 
but  the  result  of  those  computations  will  have  been  communicated 
to  their  neighbors  because  of  the  assumption  that  updating  and 
communication  occur  infinitely  often.  Hence,  there  exists  4+i  S 
max;  tk(i)  such  that  for  aii  t  2:  r*+,  and  for  all  /  and  j 

A\U)=A,(j) 

for  some  s  >  tk(j)  (which  depends  on  t,  i,  and  j).  Thus,  it 
follows  from  (2.13)  that 

Dk+iU)*A'tU)£Dk+lU)  j  €  N(i)  /=1,  •*',  N-l 

completing  the  induction  proof  and,  therefore,  the  proof  of 
convergence  of  the  distributed  asynchronous  Bellman-Ford  al¬ 
gorithm. 

When  the  link  delays  depend  on  the  traffic  flows,  it  is  also 
possible  to  obtain  the  optimal  routing  that  solves  (2.2)  in  a 
distributed  asynchronous  fashion.  Gradient  projection  algorithms 
are  better  suited  for  this  task  than  the  flow  deviation  method 
because  in  the  latter  method  a  higher  degree  of  synchronization  is 
required  in  order  for  the  nodes  to  use  the  same  step  size  at  each 
iteration.  In  the  distributed  asynchronous  implementation  of 
gradient  projection  optimum  routing  algorithms,  each  node 
broadcasts  from  time  to  time  the  values  of  its  outgoing  flows  to  its 
upstream  neighbors,  who  in  mm  pass  that  information  on  to  their 
upstream  neighbors.  In  this  way,  the  source  keeps  estimates  at  all 
times  of  the  link  flows  and  can  cany  out  the  gradient  projection 
iteration  autonomously  based  on  those  estimates.  The  first 
algorithm  based  on  this  idea  was  due  to  Gallager  [16],  who  posed 
an  alternative  formulation  to  (2.2),  where  the  unknowns  are  the 
fractions  of  flow  routed  to  each  outgoing  link  at  each  node,  rather 


As  mentioned  earlier,  there  are  two  fundamentally  different 
philosophies  to  network  routing:  eithei  viewing  it  as  a  “flow” 
problem  in  which  the  traffic  of  messages  is  modeled  as  a 
“macro”-commodity  entering  the  network  as  a  single  entity 
(static  or  quasi-static  routing),  or  as  an  individualized-raessage 
path-finding  problem  in  which  the  traffic  is  broken  down  to  its 
constituent  elementaiy  units  (dynamic  routing)— a  dichotomy  akin 
to  that  of  statistical/quantum  mechanics  in  physics.  Whereas  the 
first  approach  leads  to  optimization  problems  where  time  plays  do 
role,,  the  essential  ingredient  of  the  second  approach  is  the 
randomness  of  the  time-evolution  of  the  buffers  in  the  network, 
thus  placing  dynamic  routing  within  the  sphere  of  stochastic 
control. 

The  most  elementary  instance  of  dynamic  routing  is  die  simple 
queueing  system  shown  in  Fig.  4  which  models  a  node  with  one 
incoming  link  and  two  outgoing  links.  It  simplifies  considerably 
the  dynamics  of  the  message  arrival  process  and  of  the  service 
time  characteristics  and  ignores  processing  delay.  Thus,  the 
arrival  instants  of  messages  over  the  incoming  link  are  assumed  to 
constitute  a  Poisson  process  of  constant  rate  X.  Upon  arrival  each 
message  is  put  in  the  buffer  of  one  of  the  two  outgoing  links.  This 
action  represents  the  “control.”  The  buffers  are  assumed  to  have 
unlimited  (infinite)  capacity  and  the  message  lengths  are  assumed 
to  be  random  with  exponential  distribution  (an  obvious  additional 
simplification)  with  parameter  /<.  The  two  outgoing  links  have 
equal  capacity  of  C  bits/s.  Thus,  each  link  is  modeled  as  a 
queueing  system  with  exponential  service  time  distribution  with 
parameter  pC.  It  is  desired  to  characterize  the  optimal  control 
policy  that  minimizes  the  average  total  delay  per  message  based 
on  the  observations  of  the  “state”  of  the  system,  namely  the 
number  of  messages  <71  and  q2  in  the  two  buffers.  The  model,  of 
course,  assumes  that  the  head-of-the-iine  message  is  dropped  from 
the  buffer  as  soon  as  the  transmission  of  its  last  bit  is  completed. 

This  model,  despite  its  simplicity,  proved  to  be  rather  difficult 
to  analyze.  For  details,  see  [10];  it  is  not  important  to  repeat  them 
here.  It  should  suffice  to  state  that  the  main  result,  which  simply 
requires  that  upon  arrival  a  message  should  join  the  shortest  queue 
(with  arbitrary  decision  in  case  the  two  queues  have  equal 
numbers  of  messages),  was  hardly  surprising.  Yet  an  intricate 
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argument  on  the  dynamic  programming  equation  (DPE)  was 
needed  and  there  were  some  counter-intuitive  side-results  includ¬ 
ing  the  relaxation  of  the  Poisson  assumption  on  the  arrivals,  and 
the  fact  that  in  the  incomplete  state  information  case,  the 
certainty-equivalent  control  (i.e.,  send  the  message  to  the 
expected  shortest  queue)  need  not  be  optimum  unless  both  queues 
have  the  same  number  of  customers  initially. 

The  optimality  of  the  send-to-shortest-queue  (SS)  policy  in  the 
complete  state  information  case  can  be  proved  in  a  rather  strong 
sense.  At  all  times,  the  sum  (qt  +  q2)  and  maximum  (max  {<7,, 
<7:})  of  the  number  of  messages  in  both  buffers  are  stochastically 
minimized  by  the  SS  policy  in  the  sense  of  the  partial  order 
between  random  variables  according  to  which  the  random  variable 
X  is  stochastically  smaller  than  yif  P[X  <  a)  s  P[Y  s  a]  for 
all  a.  The  proof  of  optimality  can  be  obtained  by  the  method  of 
forward  induction  [53],  whereby  the  desired  stochastic  ordering 
between  the  queue  sizes  under  the  optimum  and  an  arbitrary 
policy  is  shown  to  be  preserved  at  each  transition. 

The  problem  formulation  of  [10]  is  one  of  many  related  ones 
(see  [8],  [9],  [22],  [24],  [33],  [38],  [54],  [55])  which  are  slightly 
more  complicated  but  share  some  fundamental  characteristics 
which,  in  fact,  extend  beyond  the  confines  of  the  routing  problem 
into  the  areas  of  priority  assignment,  resource  allocation,  and  flow 
control.  They  are  all  Markovian  decision  process  (MDP)  prob¬ 
lems.  In  the  sequel  we  will  describe  a  fairly  general  MDP  that 
includes  the  dynamic  routing  problem  as  a  special  case.  In  fact,  it 
includes  almost  all  of  the  queueing  control  problems  that  have 
been  studied  in-connection  with  communication  network  issues. 
We  will  then  oudine  the  solution  methodologies  that  have  been 
used.  These  include  basically:  1)  die  derivation  of  optimality 
conditions  from  the  DPE  associated  with  the  corresponding  MDP; 
2)  the  use  of  sample  path  stochastic  dominance  arguments,  and 
finally;  3)  the  reformuladon  of  the  MDP  as  a  linear  program.  We 
should  emphasize,  lest  the  reader  be  unduly  encouraged,  that  the 
problems  in  this  area  are  sufficiently  complex,  so  that  only  modest 
results  can  be  generally  obtained  despite  involved  arguments  and 
nontrivial  machinery.  Typically,  these  results  characterize  some 
structural  properties  of  the  optimal  policy.  However,  knowledge 
of  such  structure  is  often  sufficient  to  permit  close  approximation 
of  the  actual  optimal  policy  by  well-founded  heuristics. 

Let  us  recall  briefly  what  an  MDP  is  (for  details,  see  [30]).  We 
need  a  state  description  of  the  process  to  be  controlled.  Let  S  be  its 
state  space.  When  in  state  s  €  S,  a  set  A ,  of  admissible  control 
actions  is  specified.  When  action  a  €  A,  is  applied,  there  is  a 
transition  from  state  s  to  s'  that  is  governed  by  the  probability 
distribution  p(s’  |s,  a),  and  which  occurs  after  a  random  time  r 
which  is  exponentially  distributed  with  distribution  denoted  by 
1(t\s,  a,  s').  Clearly,  p  and  t  together  describe  the  stochastic 
dynamics  of  the  process  to  be  controlled.  Finally,  each  transition 
is  accompanied  by  a  cost  penalty  that  we  denote  by  c(r,  s,  a,  s' ). 

The  dynamic  routing  problem  we  considered  before  fits  in  this 
formulation  easily.  In  that  case,  the  state  space  is  S  =  {0,  1,  2,  3, 
•  •  •  }2.  An  element  s  =  {qlt  q2)  £  S  is  simply  the  pair  of  values 
of  the  respective  queue  sizes.  The  set  of  actions  A,  is  the  same  for 
any  state  and  consists  of  j,  and  a2  where  at  is  the  action  that 
assigns  an  arriving  message  to  the  buffer  of  link  /.  The 
distribution  p  is  of  trivial  form,  in  that  the  transitions  are 
deterministic.  Assignment  of  an  arrival  to  queue  <  augments  qt  by 
one.  Note,  now,  that  in  addition  to  the  arrival  instants,  the 
departure  (or  service  completion)  instants  are  important  because 
they  induce  state  transitions  as  well.  A  departure  from  queue  i 
reduces  Qi  by  one.  When  a  departure  occurs  there  is  no 
meaningful  control  action  that  can  be  applied  in  this  particular 
problem.  The  exponential  distribution  t  corresponds  to  times 
between  arrivals  and/or  departures.2  Finally,  the  cost  rate  c  must 


1  A  slight  modification  of  the  model  of  transitions,  called  uniformization,  is 
useful  in  that  it  introduces  dummy  transitions  from  a  state  into  itself:  thus, 
some  situations  which  introduce  nonessential  complications  can  be  handled 
without  departure  from  this  discrete  transition  time  formulation. 


reflect  the  delay.  By  Little’s  result  in  queueing  theory,  we  know 
that  the  average  delay  is  proportional  to  the  average  number  of 
customers  in  the  queue.  Thus,  c(r,  a,  s,  s')  can  be  taken  to  be 
simply  equal  to  (7i  +  q2).  This  MDP  formulation  cr  .be  extended 
to  encompass  more  complicated  queueing  control  problems. 

Let  us  return  now  to  the  general  MDP.  We  need  to  specify  the 
notion  of  a  control  policy  and  the  optimization  criterion.  Let  us 
denote  by  £2,  •  •  • ,  the  state  transitions  that  occur  at  instants 
t2,  •  •  *.  A  policy  x  is  a  sequence  of  decision  rules  Tj,  r2,  •  •  •, 
where  r„  determines  the  choice  of  action  at  the  transition  time  t„. 
It  can  be  viewed  as  a  conditional  distribution  on  the  set  of  actions 
parametrized  by  the  past  history  of  the  process. 

The  optimization  criterion  that  corresponds  to  the  practical  case 
of  expected  total  delay  is  the  long-run  average  expected  cost; 
namely,  if  we  denote  by  V(x,  i,  t)  the  expected  cost  incurred 
under  policy  x,  with  initial  state  /,  until  time  t  we  consider  as  the 
optimization  criterion  the  value  function 

V(x,  i)  £  lim  inf  — — '—)  . 

t-*CD  * 

For  technical  reasons,  however,  that  are  well  known  to  optimiza¬ 
tion  specialists,  it  is  easier  to  establish  optimality  conditions  if  we 
consider,  instead,  the  so-called  a -discounted  cost,  i.e., 

V°(x,  /)-("  e~al  dV(x,  i,  t). 

■'f-0 

The  latter  converges  to  the  former  as  a  -*  1  under  a  variety  of 
stationarity  conditions.  For  technical  reasons  that  will  become 
apparent  in  the  sequel,  we  will  also  consider  the  finite-horizon 
costs.  These  are  defined  in  a  similar  fashion  except  that  we  let 
time  extend  only  to  /„,  the  instant  of  the  nth  transition.  If  we 
denote  by  Va(i)  andK(i)  (and  also  K“(i),  V„(i)  for  the  finite 
horizon  cases)  the  values  of  these  cost  functions  when  x  is  chosen 
optimally,  we  are  led  to  the  following  DPE: 

Va(i)=  inf  2  [cU,  a>  /')  +  £(/,  a,  i')Va{i')\p{i' \a,  i) 


or 


r;+,(o«  inf  S  i«<*. a’  n+w, o 


oS  A 


i  r 


where 


P(s,  a,  s')  if  e-° 
Jo 


dt{r\s,  a,  s') 


and 


!00 

c(t,  s,  a,  s')  dt  (r|s,  a,  s') 
o 


are  the  discount  factor  and  cost  values  per  transition,  respectively. 

The  DPE  is  of  fundamental  importance  in  the  study  of  MDP’s 
because  the  value  function  Va  has  the  usually  convenient 
properties  of  convexity,  supermodularity,  and  other  forms  of 
monotonicity  that  lead  readily  to  sufficient  conditions  for  optimal¬ 
ity.  The  difficulty  with  the  analysis  of  the  DPE  is  that  the 
optimality  conditions  are  heavily  problem-dependent  and  often 
lead  to  explosively  large  numbers  of  cases  to  be  verified 
separately.  This  is  especially  true  for  MDP’s  that  arise  from 
queueing  models.  For  this  reason,  and  because  of  additional 
difficulties  that  arise  when  the  state  is  on  the  boundaries  (see 
[22]),  it  became  evident  that  alternative  methods  of  solution  were 
needed. 
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One  alternative  method  that  has  received  attention  recently  and 
which  produced  successful  results  in  problems  of  queueing  control 
(akin  to  the  routing  problem)  is  a  probabilistic  method  called 
sample-path  or  stochastic  dominance.  This  method  bypasses 
completely  dealing  with  the  value  function.  Instead,  it  focuses 
directly  on  seeking  the  optimal  policy.  Let  G  be  the  class  of 
admissible  policies.  If  we  suspect  that  the  optimal  policy  x  has  a 
property- p ,  then  we  can  proceed  as  follows  in  order  to  prove  that 
it  actually  does  have  that  property.  Let  S  be  a  subset  of  G,  to 
which  we  know  the  optimal  policy  belongs.  We  consider  a  subset 
of  policies  Sp  C  S,  all  elements  of  which  have  the  property  p.  For 
every  x  (£  Sp,  we  attempt  to  construct  a  policy  x  which 
outperforms  x.  If  we  succeed,  we  must  conclude  that  the  optimal 
policy  belongs  to  Sp.  In  constructing  x  we  often  need  to  engage  in 
a  careful  reorganization  of  the  underlying  probability  space  in 
order  to  align  the  sample  paths  properly,  so  that  the  comparison  of 
the  two  policies  can  be  made  for  every  sample  path.  This 
procedure  is  full  of  risks  and  extreme  care  is  required  to  avoid 
faulty  arguments.  Note,  also,  that  to  apply  this  method  usefully, 
we  must  have  “guessed”  the  properties  of  the  optimal  policy 
correctly.  Thus,  at  best,  it  is  a  method  to  verify  the  validity  of  our 
conclusions,  rather  than  a  method  that  leads  us  to  the  right 
conclusions. 

Successful  use  of  the  stochastic  dominance  approach  was  made 
in  [52]  and  [50]  where  a  problem  that  is  dual  to  the  problem  of 
dynamic  routing  was  studied.  Specifically,  in  a  two-server 
queueing  system  in  which  the  two  servers  have  unequal  service 
rates,  we  wish  to  determine  whether  and  when  the  slower  server 
needs  to  be  activated  if  we  are  interested  in  minimizing  the  usual 
total  expected  delay  function.  That  the  optimal  policy  has  a 
threshold  form  (namely  that  the  slower  server  must  be  activated 
when  the  queue  size  exceeds  a  crucial  value)  was  proven  in  [29] 
via  the  DPE  method.  However,  the  alternative  proof  via  the 
arguments  of  stochastic  dominance  was  much  simpler  and  led  to  a 
generalization  of  the  result  to  cases  of  nonexponential  arrivals 
and/or  service,  that  could  not  have  been  easily  accomplished  by 
means  of  the  DPE  method. 

Another  successful  use  of  the  stochastic  dominance  method  has 
been  noted  in  [2].  In  this  case  ihe  problem  of  optimally  choosing 
which  customer  to  serve  next  in  a  single  queueing  system  was 
considered  under  the  constraint  that  each  customer  must  begin  (or 
terminate)  service  by  an  individually  assigned  random  deadline  or 
else  it  is  dropped  from  the  system.  The  cost  criterion  is  then  to 
minimize  the  expected  number  of  lost  customers.  It  was  proven 
that  scheduling  the  customer  with  shortest  time  to  extinction 
minimizes  this  cost. 

Although  these  problems  differ  from  routing,  the  model 
structures  are  quite  similar,  and  it  has  been  observed  that,  usually, 
queueing  control  problems  with  such  structural  similarities  can  be 
studied  equally  successfully. 

The  third  method,  which  was  first  used  in  [38]  in  the  study  of  a 
specific  queueing  control  problem,  and  which  has  been  broadly 
extended  recently  in  [51],  is  the  linear  programming  approach. 
Almost  any  queueing  control  problem  that  can  be  formulated  as  a 
MDP  (therefore  the  problem  of  dynamic  routing,  as  well)  can  be 
converted  to  an  equivalent  linear  program  (LP).  The  advantages 
of  this  conversion  are  that  it  is  problem-independent  and  it  leads 
occasionally  to  successful  study  of  semi-Markov  decision  prob¬ 
lems  as  well.  Furthermore,  it  facilitates  considerably  the  charac¬ 
terization  of  optimal  solution  properties.  Here  is  how  this 
equivalence  can  be  demonstrated. 

Let  us  concentrate  on  an  MDP  under  a  finite-horizon,  dis¬ 
counted  cost  formulation.3  We  shall  consider  a  queueing  model 
with  state  dynamics  given  by 

xk*\  =xk+ £*+iZ*+ 1 . 


Here,  xt  denotes  the  state  at  tk  (the  instant  of  the  Arth  transition),  £* 
represents  that  transition,  and  z*  represents  the  control  action  at 
that  transition.  The  transition  £*  can  represent  an  arrival  or  a 
departure  as  an  increment  of  the  state.  The  control  z*  is 
conveniently  defined  to  enable  (z*  =  1)  or  disable  (z*  =  0)  a 
transition.  For  example,  in  the  routing  model  discussed  at  the 
beginning  of  the  section,  the  state  is  equal  to  a  two-dimensional 
vector  of  queue  sizes,  and  the  transition  corresponding  to  sending 
an  arriving  message  to  the  first  queue  would  be  represented  by  £* 
=  [1  0]r..lndeed,  a  variety  of  queueing  control  problems  (in  fact, 
the  vast  majority  of  those  that  have  been  considered  in  connection 
with  communication  network  problems)  can  be  so  represented. 

Note  that  the  crucial  aspect  of  this  state  equation  is  the  linear 
dependence  on  the  controls.  Note  also  that  usually  the  cost 
function  is  linear  in  the  state  (since  the  usual  cost  criterion  is  the 
expected  delay  which  is  coupled  to  the  queue  sizes,  and  hence  the 
state,  by  Little’s  result).  Consequently,  the  cost  is  linear  in  the 
controls.  The  minimization  of  the  cost  over  the  set  of  control 
trajectories  is  constrained  since  the  state  equation  must  be  satisfied 
and  the  state  must  always  belong  to  an  admissible  set  (typically,  a 
set  of  vectors  with  integer-valued  coordinates  belonging  to  given 
ranges).  Thus,  the  constraints  are  also  linear  in  the  controls,  and 
the  problem  is  easily  formulated  as  an  LP.  There  are,  however, 
two  points  that  require  attention.  First,  the  controls  are  integer¬ 
valued,  i.e.,  z*  S  {0,  1}.  Second,  in  the  MDP  the  vectors  £*  are 
random  and  depend  on  past  history. 

The  first  problem  is  taken  care  of  in  one  of  two  ways:  by 
construction  or  by  use  of  a  property  of  the  constraint  matrix  of  the 
linear  program,  called  unimodularity.  The  construction  method 
involves  using  a  noninteger  optimum  control  whose  quantized 
version  satisfies  the  MDP  optimality  conditions  (see  [38],  [5 1  ]  for 
details).  The  use  of  unimodularity  involves  a  well-known  result  in 
the  theory  of  integer  linear  programming  (e.g.,  [34]):  if  the 
constraint  matrix  of  an  LP  is  integer-valued  and  totally  unimodu- 
lar  (i.e.,  each  of  its  sub-determinants  is  +1,  -  1,  or  0),  then  all 
the  vertices  of  the  feasible  polytope  are  integer-valued.  Therefore, 
no  further  restrictions  are  needed  to  guarantee  that  the  solution  of 
a  conventional  LP  will  result  in  the  integer-valued  optimal 
control.  Fortunately,  in  many  queueing  problems  of  interest 
(including  the  dynamic  routing  problem),  the  constraint  matrix  is 
indeed  totally  unimodular. 

The  second  problem  is  easily  taken  care  of  by  thinking  of  the 
Zk  s  as  functions  from  the  sample  space  fi  to  the  action  space. 
Thus,  the  cost  criterion  can  be  written  as  a  functional  on  the 
underlying  probability  space. 

Let  z*(u*)  represent  the  control  action  at  the  Arth  transition, 
where  w*  denotes  the  random  “history”  until  the  Ath  transition. 
We  have 

**+i(«*+i)=-,ft(w*)  +  z*+i(«*+i)£*+i(M*+i)- 

Let  S  and  Z  be  the  set  of  admissible  states  and  controls, 
respectively.  The  /3 -discounted,  n -step,  expected  cost  under 
policy  z  and  initial  condition  x  is  given  by 

z)  =  Ez'!£  fi*L(zt) 

k-o 

where 

L(Zk)=crxk  +  dTzk 

(c  and  d  denote  constant  column  vectors).  This  is  a  cost  function 
that  is  adequately  general.  For  example,  in  a  pure  resource 
allocation  problem  without  blocking  or  rejection  of  messages  we 
have  d  =  0,  while  in  pure  blocking  problems  we  take  c  =  0.  The 
state  equation,  after  repeated  iterations,  yields 


5  The  reason  that  we  cannot  work  directly  with  infinite  horizons  is  the 
possibility  of  so-called  duality  gaps  in  linear  programming  theory  with 
infinite-dimensional  variables. 


k 

Xki^k)  —X+  ^ 
/-I 


k>  0. 
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Therefore, 

Jsn(x,  z)  =  Ex'£  0k  \cTx+c7"2  z1$j+dTzk] 

*- 0  V  Jn  I  j 

-~r~^  crx+Ex  £  0*  f  S  • 

1_P  *-i  M-i  ' 

But 

£,(z*)  =  S  ZkMPrM' 

uk 

Hence 

./£(*,  z)  =  -  — |r  ^7*+  S  S  T*(w*)^(“*) 

*~P  *- 1  a* 

where  7*(a>*)  is  a  known  function  that  depends  on  Pr{uk),  c,  £*, 
and  0k.  Consequently,  the  MDP  is  equivalent  to 

n 

min  J)  yk(o>k)Zk(uk) 

!k  *-l  u*. 

subject  to 

(x+  J  z;(w/)f;(«/)^  6  5 

which  is  a  conventional  LP  where  the  initial  condition  plays  the 
role  of  a  parameter,  the  sensitivity  with  respect  to  which  can  be 
studied  by  the  well-developed  theory  of  sensitivity  analysis  of 
linear  programming  [15], 

In  conclusion,  we  see  that  the  MDP  is  converted  to  an 
equivalent  LP  under  very  mild  conditions  that  are  usually  satisfied 
by  dynamic  routing  and  other  queueing  control  problems.  Thus,  a 
third  alternative  methodology  becomes  generally  available  for  the 
study  of  these  problems.  Whether  to  choose  from  the  arsenal  the 
DPE  approach,  or  the  LP  method,  or  stochastic  dominance  tools, 
depends  on  the  problem  and  on  the,  as  yet  undeveloped,  intuition 
that  the  investigator  should  possess. 

HI.  Multiple-Access  Communications 

The  communication  networks  considered  in  the  discussion  of 
routing  problems  in  Section  II  consist  typically  of  a  set  of  nodes 
connected  by  point-to-point  communication  links.  Each  of  these 
links  viewed  in  isolation  can  be  modeled  as  a  classical  communi¬ 
cation  channel  with  one  sender  and  one  receiver.  In  this  section, 
we  consider  multipoint-to-point  communication  links  where  sev¬ 
eral  transmitters  share  a  common  channel.  Multiple-access 
channels  are  the  basic  building  blocks  of  radio  networks,  satellite 
communication,  and  local  area  networks,  and  during  the  last  15 
years  have  attracted  the  attention  of  many  communication, 
information,  and  control  theorists. 

There  is  a  wide  variety  of  strategies  to  divide  the  “resources” 
of  a  communication  channel  among  several  geographically  dis¬ 
persed  transmitters.  The  simplest  methods  are  those  that  assign  a 
permanent  independent  sub-channel  to  each  transmitter  (e.g.,  in 
frequency  division  multiple  access  and  time  division  multiple 
access);  these  strategies  are  easy  to  analyze  and  are  widely  used  in 
practice  in  situations  where  the  users  need  to  transmit  at  fairly 
steady  rates.  If  the  transmitters  are  bursty  (i.e.,  the  radio  of  peak- 
to-average  rate  at  which  the  need  to  transmit  is  high)  those  static 
methods  are  inefficient  since  most  of  the  time  the  channel  is 
underutilized  while  demand  (and  induced  delay)  accumulates  at 
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busy  terminal  locations.  Dynamic  channel  sharing  strategics 
overcome  this  problem  by  allocating  channel  resources  on  an  on- 
demand  basis.  Consistent  with  the  overall  spirit  of  this  paper,  our 
goal  here  is  not  to  review  this  vast  topic,  but  rather  to  demonstrate 
how  control  theory  can  play  a  useful  role  in  its  study.  Here  we 
wish  to  single  out  two  multiple  access  strategies:  random  access 
and  simultaneous  transmission,  which  are  broadly  representative 
of  dynamic  channel  sharing  systems  and  in  which  control  theoretic 
concepts  have  played  a  pivotal  role. 

In  random  access  communication,  the  conceptual  allocation 
model  is  addressed  without  an  effort  to  exploit  the  signaling 
degrees  of  freedom  and  the  micro-structure  of  the  transmitted 
messages.  For  this  purpose,  a  crude  channel  model  is  considered, 
that  achieves  this  separation  of  the  “macro”  from  the  “micro” 
problem.  In  simultaneous  transmission  systems,  however,  a  more 
refined  viewpoint  is  adopted,  by  taking  the  realities  of  the  medium 
into  account,  modeling  them,  and  exploiting  them. 

A.  Random-Access 

The  object  of  interest  here  is  the  so-called  collision  channel 
model,  in  which  messages  (called  packets)  require  one  time  unit 
(called  slot)  for  transmission  and  are  sent  by  a  population  of  users 
who  are  synchronized  so  that  their  slots  coincide  at  the  receiver, 
but  are  otherwise  uncoordinated  and  unaware  of  which  and  how 
many  users  have  packets  to  transmit.  If  two  or  more  packets  are 
simultaneously  transmitted,  it  is  assumed  that  the  receiver  is 
unable  to  recover  any  of  the  messages,  and  they  have  to  be 
retransmitted  in  a  future  slot.  In  the  ALOHA  algorithm,  which 
was  developed  in  the  early  1970’s  [1]  at  the  University  of  Hawaii 
and  marked  the  beginning  of  the  area  of  random-access  communi¬ 
cation,  each  packet  that  has  been  unsuccessfully  transmitted 
before  is  transmitted  with  probability  p  in  the  next  slot.  New 
packets  which  have  not  attempted  transmission  before  are 
transmitted  with  probability  either  1  or  p  depending  on  which 
version  of  the  ALOHA  algorithm  is  used.  In  our  discussion,  we 
will  assume  the  latter  choice. 

Under  these  conditions,  and  assuming  that  the  number  of  newly 
generated  packets  in  each  slot  is  a  random  variable  (with  mean  X) 
independent  from  slot  to  slot,  the  number  of  packets  awaiting 
transmission  (called  backlog)  is  a  Markov  chain  taking  values  in 
{0,  1,  2,  •  •  • }.  The  central  problem  is  to  investigate  under  what 
conditions  the  backlog  Markov  chain  is  ergodic,  i.e.,  it  is  stable  in 
the  sense  that  it  reaches  a  steady  state  in  which  the  periods 
between  the  times  when  there  are  no  packets  to  transmit  are  not 
tcx>  infrequent  (they  have  finite  expected  value).  The  transition 
probabilities  of  the  Markov  chain  are  parametrized  by  the  rate  of 
arrival  of  new  packets  X  and  the  retransmission  probability  p. 
Whereas  X  is  fixed  and  given,  p  is  chosen  by  the  transmitters. 
Hence,  we  are  dealing  with  a  fairly  simple  controlled  Markov 
chain  whose  control  space  is  the  interval  (0,  1],  In  the  original 
ALOHA  algorithm,  the  control  p  remained  constant  and  common 
to  all  transmitters  regardless  of  the  information  acquired  by 
listening  to  the  channel,  thereby  resulting  in  the  open-loop  control 
of  the  Markov  chain.  Depite  several  “proofs”  of  the  stabilitiy  of 
ALOHA  published  during  the  1970’s,  neither  the  actual  system 
built  in  Hawaii  nor  the  ideal  Markov  chain  model  were  stable.  The 
reason  why  the  open-loop  system  is  unstable  can  be  easily 
understood  by  considering  the  backlog  drift,  d(n),  which  is 
defined  as  the  expected  increase  in  the  backlog  over  the  next  slot 
when  the  current  value  of  the  backlog  is  equal  to  n.  It  is  easy  to  see 
that  the  backlog  drift  is  given  simply  by  the  expected  number  of 
new  packets  per  slot  minus  the  expected  number  of  successfully 
transmitted  packets  in  the  next  slot,  i.e., 

d(«)=X-[«p(l  -/>)•-'].  (3.1) 

The  drift  quantifies  the  expected  evolution  of  the  Markov  chain 
from  each  "state,  and  therefore  it  is  a  valuable  tool  in  analyzing  the 
stability  of  the  chain.  For  any  p  €  (0,  lj  the  term  in  brackets  in 
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(3.1)  goes  to  0  as  n  -*  oo,  and  hence,  the  drift  is  positive  and  dose 
to  X  for  sufficiently  large  backlogs.  This  implies  that  when  the 
backlog  is  large  it  tends  to  grow,  thereby  eliminating  any  hope  for 
stability.  Using  standard  results,  this  reasoning  can  be  formalized 
straightforwardly  to  prove  not  only  the  instability  of  the  open-loop 
system  [1 1 1  for  all  values  of  X  and  p ,  but  the  fact  that  the  backlog 
goes  to  infinity  with  probability  one  [25],  [35],  [40], 

Fortunately,  the  system  can  be  stabilized  by  closed-loop 
control.  Let  Us  examine  first  the  case  of  complete-state  informa¬ 
tion,  i.e.,  each  station  is  informed  at  the  end  of  each  slot  of  the 
current  value  of  the  backlog  and  chooses  the  retransmission 
probability  on  the  basis  of  that  information.  As  far  as  stability  is 
concerned,  the  best  choice  of  the  retransmission  probability  p  is 
the  value  that  minimizes  the  drift  because  that  results  in  the 
maximum  possible  arrival  rate  that  guarantees  stability  (called  the 
throughput).  It  follows  from  (3.1)  that  the  optimum  value  of  p  is 

P*ln)=~,  n=  1,2,  (3.2) 

n 

and  the  resulting  drift  is 

1  "I «-■ 

(3.3) 

which  is  negative  for  n  >  1  when  X  <  e~ 1 ,  and  is  positive  for 
large  backlogs  when  X  >  e~'.  Therefore,  the  throughput  of  the 
closed-lojp  system  with  complete  state  information  is  e'1  = 
0.368.  However,  the  relevance  of  complete  state  information 
feedback  is  rather  limited  in  practice.  This  is  because  the 
instantaneous  value  of  the  backlog  is  available  to  each  station  only 
if  there  exists  so  large  a  degree  of  communication  among  the 
transmitters  that  much  more  efficient  algorithms  than  ALOHA 
can  be  used. 

The  case  of  partial  state  information  is  the  problem  of  interest  in 
practice,  since  the  only  feedback  available  to  each  station  is  the 
outcome  (collision,  success,  empty)  of  the  transmission  in  each 
slot.  The  analysis  of  the  controlled  system  with  partial  state 
information  was  pioneered  by  Hajek  and  Van  Loon  [20]  who 
proposed  a  recursive  updating  law  of  the  retransmission  probabili¬ 
ties  as  a  function  of  the  channel  outcomes.  This  feedback  policy 
was  shown  in  [21]  to  attain  the  throughput  achievable  with 
complete-state  information,  namely  e~l.  Those  papers  and  subse¬ 
quent  works  have  referred  to  the  problem  as  decentralized 
control  of  ALOHA,  motivated  by  the  fact  that  each  station 
chooses  the  retransmission  probability  autonomously  based  on  the 
channel  feedb"'’  ’owever,  it  is  useful  to  recognize  that  the 
problem  boi’  to  (centralized)  stochastic  control  with  one 

decision  rr  .  and  incomplete  state  information  because  all 
stations  are  constrained  to  use  the  same  retransmission  probabili¬ 
ties. 

We  will  review  here  the  proof  of  stability  of  the  following 
certainty-equivalence  closed-loop  control: 

P(n)=~  (3.4) 

where  h  is  an  estimate  of  the  backlog  updated  according  to 

max  {1,  n*+a}  kth  slot  is  idle 
/?*  +  / 3  &th  slot  is  success  or  collision. 


BACKLOG 

Fig.  5.  Drift  of  (backlog,  backlog  estimate)  Markov  process  for  decentral¬ 
ized  control  with  a  =  -  1.48,  0  =  0.8,  and  A  =  0.33. 


«*)}*  (rather  than  the  backlog  itself)  which  is  a  Markov  process. 
According  to  (3.4)  and  (3.5)  the  drift  of  this  Markov  process  is 
given  by 

£[(n*+i.  d*+i)-(n*,  nk)\(nk,  nk)  =  {n,  s)] 

“  [*”5]  .  0+(«»ax  {a,  i-s}-jS) 

=  (d(n,  s),  c(/i,  s)).  (3.6) 

Contrary  to  what  we  saw  in  the  case  when  the  state  is  known,  it  is 
not  true  that  die  backlog  drift  is  negative  for  sufficiently  large 
backlogs.  As  we  can  see  in  Fig.  5,  if  the  estimate  is  far  from  the 
true  v^ue,  then  the  backlog  may  actually  tend  to  increase. 

However,  at  every  point  in  the  state  space  the  tendency  of  the 
process  is  to  approach  the  diagonal  where  the  estimate  is  equal  to 
the  true  value  of  the  backlog.  Furthermore,  as  Fig.  5  or  the 
analysis  of  the  perfect-state  information  case  shows,  the  drift 
along  the  diagonal  is  negative.  Such  a  behavior  is  a  strong 
indication  of  the  stability  of  the  controlled  Markov  process. 

This  can  be  proved  using  a  powerful  sufficient  condition  found 
by  Mikhailov  [31]  for  the  stability  of  a  Markov  process  taking 
values  in  X  K+.  In  essence,  Mikhailov’s  condition  states  that 
it  is  enough  to  restrict  attention  to  those  points  of  the  state  space 
where  either  the  backlog  or  its  estimate  are  large  and  at  which  the 
drift  is  radial ,  i.e., 

d(n,  s)  n 
c(/»,  s)  s  ’ 

then,  it  is  sufficient  for  stability  that  the  drift  point  towards  the 
origin  at  those  states.  To  see  that  this  condition  is  indeed  satisfied 
for  our  system,  we  compute  first  the  asymptotic  drifts  along  the 
radius  {(/!,  s ):  n/s  =  for  0  €  [0,  00) 

d(^)  =  lim  d(\ps,  s)  =  \-<pe~*  (3.7a) 


(3.5) 

The  throughput  attainable  with  this  feedback  law  depends  on  the 
constants  a  <  0  and  >  0.  As  we  will  see,  there  exists  a  set  of 
choices  for  those  constants  that  results  in  throughput  equal  to  e~  ’. 

Unlike  the  case  of  complete-state  information,  the  proof  of 
stability  is  not  straightforward  because  now  it  is  the  two- 
dimensional  process  formed  by  the  backlog  and  its  estimate  {(/»*, 


c(i£)  =  lim  ctys,  s)=/S  +  (a-/3)e-*.  (3.7b) 

S-O* 

It  can  be  checked  using  (3.7)  that  if  Lhe  constants  a  and  (3  in  (3.5) 
are  chosen  such  that  18  >  0.23XandX  -  e~'  =  0  +  (a  -  /3)e~', 
then  the  drift  is  radial  only  at  ^  =  1  (cf.  Fig.  5),  where  it  points 
towards  the  origin  as  long  as  d(l)  =  X  -  e_l  <  0. 
Mikhailov’s  sufficient  condition  can  be  justified  constructing  a 
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stochastic  Lyapunov  function  to  prove  the  stability  of  a  Markov 
process  {at*}*  with  state  space  (J2+  X  lfl+.  To  that  end,  it  is 
advantageous  to  switch  to  polar  coordinates  (r,  <t>)  and  to  define 
the  radial  drift  5(r,  <f>)  as  the  projection  of  the  drift  along  the 
direction  of  the  point  (r,  <f>)  and  the  tangential  drift  n(r,  4>)  as  the 
projection  of  the  drift  along  the  direction  perpendicular  to  ( r ,  t/>). 
Denote  the  asymptotic  drifts  S($)  =  lim,..„  8(r,  <f>)  and  n(<t>)  = 
lim,..*  n(r,  4>)  and  define  the  function 

V{r, 

where 


B.  Simultaneous  Transmission 

In  contrast  to  random-access  communication  systems,  in 
simultaneous  transmission  multiple-access  systems,  the  transmit¬ 
ters  send  their  messages  simultaneously,  independently,  and 
without  monitoring  the  channel  in  any  way.  The  most  common 
type  of  simultaneous  transmission  system  is  code-division  multi¬ 
plexing,  where  each  user  modulates  a  preassigned  signature 
waveform  known  by  the  receiver. 

Specifically,  we  will  assume  that  in  order  to  send  the  message 
{bk(i)  €  A  (i.e.,  a  string  of  M symbols  drawn  from  a  finite 
set  A),  the  kth  user  transmits 


#(<£)  =  exp 


(-(?£*•)*) 


M—  I 

£ 


1-0 


bk(i)sk(t~iT) 


Note  that  V(r,  4>)  is  a  candidate  Lyapunov  function  because  it  is 
positive  outside  the  origin  and  V{r,  <f>)  -»  oo  as  r  -*  oo. 
Furthermore,  it  can  be  shown  [31]  that  the  asymptotic  drift  of  the 
candidate  Lyapunov  function  is  equal  to 

lim  ElV(xk^)-V(xk)\xk  =  (r,  «)]  =  *(^)f«(0) -<V(d)].  (3.8) 

r— oo 

Now,  under  Mikhailov’s  condition,  the  asymptotic  drifts  are 
assumed  continuous  on  [0,  x/2]  and  8{<f>)  <  (  for  any  phase  such 
that  n(4>)  =  0  (i.e.,  whenever  the  drift  is  radial  it  points  towards 
the  origin),  therefore,  the  constant  C  can  be  chosen  large  enough 
so  that  the  left  side  of  (3.8)  is  upper  bounded  by  a  negative 
constant.  This  implies  that  V(r,  <f>)  is  indeed  a  stochastic 
Lyapunov  function  and  therefore  standard  results  on  the  stability 
of  stochastic  systems  [27],  [45]  can  be  applied  to  show  the 
stability  of  the  system.4 

In  some  multiaccess  environments,  the  receiver  can  indeed 
demodulate  reliably  one  or  more  pa«.kets  even  in  the  presence  of 
other  interfering  packets  and  the  collision  channel  model  no 
longer  applies  to  those  cases.  The  results  reviewed  in  tins  section 
can  be  generalized  to  a  general  channel  with  multipacktt 
reception  capability,  to  show  that:  1)  the  throughput  of  open-loop 
ALOHA  rs  equal  to  the  lnntf  of  the  expected  number  of 
successfully  received  packets  per  slot  as  the  backlog  goes  to 
infinity  (171;  and  2)  the  throughput  of  closed-loop  ALOHA  (with 
either  complete  or  partial  state  information)  is  equal  to  the 
maximum  over  v  of  the  expected  number  of  successfully  received 
packets  per  slot  when  the  number  of  attempted  transmissions  is  a 
Poisson  random  variable  with  mean  u  [18]. 

Returning  to  the  case  of  the  collision  channel,  the  nest  natural 
step  is  to  drop  the  main  restriction  in  the  ALOHA  algorithm, 
namely,  that  all  stations  use  the  same  retransmission  probability. 
This  is  done  in  a  class  of  random-access  algorithms  referred  to  as 
collision  resolution  algorithms  which  are  characterized  by  the  fact 
that  not  only  are  all  blocked  packets  eventually  retransmitted 
successfully,  but  all  users  eventually  become  aware  that  these 
packets  have  been  successfully  retransmitted.  Contrary  to  the 
ALOHA  algorithm,  the  decision  whether  or  not  to  transmit  a 
packet  takes  into  account  the  previous  history  of  attempted 
retransmissions  of  that  particular  packet.  The  introduction  of  this 
new  dimension  into  the  problem  renders  Markov  chain  tools 
considerably  less  useful  than  in  the  foregoing  analysis  and 
converts  it  into  a  very  difficult  decentralized  stochastic  control 
problem,  for  which  the  optimum  throughput  remains  unknown5 
despite  many  efforts. 


*  Another  choice  of  stochastic  Lyapunov  function  for  the  specific  case  of 
decentralized  control  of  ALOHA  can  be  found  in  [43]. 

3  The  best  known  algorithm  has  been  shown  to  achieve  a  throughput  of 
0.488  using  Howard’s  policy  iteration  for  sequential  infinite-horizon  problems 
[32]  or  by  reduction  to  a  simple  optimization  problem  [48],  On  the  other  hand, 
it  is  known  that  the  optimum  throughput  is  upper  bounded  by  0.568  [44]. 


where  {s*(/),  0  ^  f  :S  7*}  is  the  waveform  assigned  to  the  Arth 
user,  and  T  is  the  symbol  period.  Then  the  demodulator  receives 
the  sum  of  the  signals  transmitted  by  the  K  active  users  embedded 
in  noise 

r(t)=  £  S'  bk(i)S'(t-iT-rk)+n(t)  (3.8) 

*-l  1-0 

where  the  offsets  r*_  |  s  rk  E  [0,  T)  model  the  fact  that  the  users 
do  not  synchronize  their  transmissions.  Then  the  task  of  the 
receiver  is  to  recover  the  transmitted  information  strings 
{bk(.i)}y„ Following  [47]  we  will  show  how  to  obtain  an 
optimum  multiuser  demodulator  via  dynamic  programming.  First, 
denote  the  Afflf-vector 

</={</*+*= &*(/).  *-i,  •••,*,  /=0,  •••,  M—  1} 
and  the  multiuser  signal  in  (3.8) 

x  «-t  UK 

£  £*0>*(/-<T-r*)  =  2  diZ,(t)  (3.9) 

t-l  /-a  im  | 

where  =  •**(/  -  »T  -  rk). 

A  mwwrabh;  criterion  for  demodulating  the  information  carried 
■  d)  npon  observation  of  r(t)  is  to  select  the  A/A-vector  d 

that  best  explains  the  received  waveform  in  the  sense  of 
minimizing  the  energy  of  the  corresponding  noise  realization, 
Le., 

min  | S(/,rf)-r(/)|2.  (3.10) 


If  the  noise  n(t)  is  white  and  Gaussian,  then  this  criterion  results 
in  maximum  likelihood  decisions.  Equivalently,  the  objective  is  to 
find  the  vector  that  solves 

max  Q (d)  (3.11) 

dZA** 

where 

Q(d)= 2  [“  S(t,  d)r(t)dt-  ["  S2(t,  d)  dt.  (3.12) 

Since  the  maximization  in  (3. 1 1)  is  over  a  finite  set,  we  could  solve 
it  by  the  brute-force  method  of  evaluating  Q (d)  for  each  possible 
argument.  However,  it  is  possible  to  decompose  0(d)  in  a 
sequential  fashion  that  lends  itself  to  efficient  optimization.  From 
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Fig,  6.  Symbol  epochs  for  =  3  and  M  -  S. 

(3.9)  it  is  immediate  to  write  the  first  integral  in  (3.12) 
sequentially 


where 


{09  ‘Y‘l\ 

so,  d)r(t)dt=^djyj 
JZx 

yj=  (  Z;(/)r(/)  dt. 

*  —  o* 


(3.13) 


(3.14) 


where  g*(m)  =  R(k  +  K,  k  +  m).  Putting  together  (3.12), 
(3.13),  and  (3.16)  we  see  that  we  can  express  fl(d)  as  a  sum  of 
MK  terms,  each  of  which  depends  on  K  components  of  d  and  such 
that  consecutive  terms  depend  on  the  same  components  but  one. 
Specifically,  we  can  write 


MK 

fi(rf)=£  Mxj.dj) 

/- 1 

where 


M*.  “)  =  u[2yj+  u*v,(/>- 2xrg,(y)| 


(3.17) 


(3.18) 


and  Xj  is  the  state  of  a  shift-register  K  -  1  dimensional  system 

*/+,  =  IWl).  •",*,+  ,(*-!)]  =  (*, (2), 


x„  =  0.  (3.19) 


This  implies  that  the  objective  function  (3.12)  depends  on  r(t) 
only  through  the  quantities  {yj}^,  which  are  obtained  by 
correlating  r(t)  with  each  of  the  signature  waveforms  during  each 
symbol  epoch.  In  order  to  find  an  explicit  expression  for  the 
second  integral  on  the  right-hand  side  of  (3.12),  which  is  the 
energy  of  the  multiuser  signal,  we  will  denote 

RU,  /)=-  ("  Zj(t)z,(t)  dt.  (3.15) 

J  -00 

It  follows  immediately  from  the  definition  that  these  coefficients 
satisfy  the  following  properties. 

1)  R(k  +  IK,  k  +  iK)  =  J Jj*(/)  4  wk. 

2)  R(k  +  iK,n  +  iK)  m  R(k,  n)  for  all  /. 

3)  R(j,  l )  =  0  unless  | j  -  l\  <  K. 

The  first  property  indicates  that  each  of  the  diagonal  elements 
of  R(i,  j)  is  equal  to  the  energy  of  one  of  the  K  assigned 
waveforms.  The  second  and  third  properties  can  be  illustrated  by 
referring  to  Fig.  6  which  represents  the  symbol  epochs  of  three 
asynchronous  users  sending  strings  of  M  =  5  symbols.  Each 
symbol  period  in  Fig.  6  is  labeled  with  the  index  of  the 
corresponding  component  of  the  vector  d.  The  second  property 
indicates  that  the  cross-correlations  between  two  signals  depend 
only  on  their  relative  location  (e.g.,  R( 4,  6)  =  /?  ( 1 3 ,  15)  in  Fig. 
6)  and  the  third  property  states  that  each  symbol  only  interferes 
with  2K  -  2  symbols  of  the  other  users  [e.g.,  in  Fig.  6,  d9  = 
b3(2)  only  overlaps  with  dn  =  b[( 2),  d%  =  bfl),  di0  =  b\(3),  and 
dn  =  62(3)].  It  follows  from  these  properties  that  the  coefficients 
in  (3.15)  can  be  obtained  from  the  K  X  K  matrix  {f?(Ar, 
,  whose  diagonal  elements  correspond  to  the  energy  per 
symbol  of  each  user  and  whose  off-diagonal  elements  correspond 
to  the  cross-correlations  between  the  signature  waveforms  of  each 
pair  of  users.  Using  (3.15),  the  foregoing  properties,  and  letting 
<(j)  £  {1,  •  •  •,  K)  be  the  modulo-#  remainder  of  j  (i.e.,  for 
some  i,  j  =  x(j)  +  iK),  we  can  write 

MK  MK 

J  SHt,d)dt='2^dJdlRU,l) 

j- 1  /- 1 

mk  r  /- 1  1 

=  S  dj\  w«o)+2  2  dtRU,  I ) 
j- 1  L  lwj-K+l  J 

mk  r  K- I  1 

=  S  dJ  \  W‘U)  +  2  S  dj-ngi(MK~n) 
j- I  L  n- I  J 

(3.16) 


It  is  now  apparent  that  the  solution  to  (3.11)  entails  solving  a 
finite-horizon  deterministic  optimal  control  problem  with 
additive  costs  per  stage  for  the  linear  system  in  (3. 19),  and  with  a 
finite  admissible  control  set  A .  Therefore,  optimum  multiuser 
demodulation  is  equivalent  to  a  sho'test  path  problem  in  an  M- 
stage  layered  directed  graph,  where  at  each  stage  there  are  AK~' 
states.  This  optimization  problem  can  be  solved  by  dynamic 
programming  (e.g.,  [7])  in  backward  or  forward  fashion.  In 
practice,  it  is  necessary  to  demodulate  the  transmitted  symbols  in 
real-time,  and  since  M  is  usually  a  very  large  integer,  it  is  not 
feasible  to  wait  until  all  the  observables  {y}  }■'[*,  have  been 
obtained  before  starting  to  make  decisions.  Therefore,  a  subopti¬ 
mum  version  of  the  forward  dynamic  programming  algorithm  is 
adopted  in  practice  whereby  each  decision  is  based  on  the  paths 
corresponding  to  the  cost-to-arrive  function  computed  a  fixed 
number  of  steps  ahead.  This  real-time  version  of  forward  dynamic 
programming  is  known  in  communication  theory  as  the  Viterbi 
algorithm  [12],  and  was  originally  devised  (without  resorting  to 
the  dynamic  programming  framework)  for  decoding  convolu¬ 
tional  codes.  The  maximum-likelihood  criterion  used  in  (3.10)  is 
not  the  only  possible  optimality  criterion.  For  example,  if  the 
objective  is  to  minimize  the  probability  of  error  for  each  user, 
then  the  multiuser  demodulator  uses  a  brckward-forward 
dynamic  programming  algorithm  [49]  whereoy  optimum  deci¬ 
sions  are  based  on  the  independent  computation  of  a  cost-to-go 
and  a  cost-to-arrive  function. 


IV.  Other  Problem  Areas 

Routing  and  multiple  access  are  not  the  only  problem  areas  in 
the  field  of  communication  networks  which  control  theory  can 
help  formulate,  study,  and  solve.  We  have  deliberately  chosen  to 
confine  our  attention  to  these  two  areas  in  order  to  get  across  m  a 
concise  manner  our  belief  that  the  field  of  communication 
networks  offers  a  rich  selection  of  applications  for  control  theory. 
We  would  feel  remiss,  however,  if  we  did  not  even  make  an 
attempt  to  provide  a  taste  of  some  of  the  numerous  other  design 
and  operation  issues  that,  again,  bring  forth  control  systems 
concepts  and  techniques.  For  this  purpose,  and  with  a  conscious 
effect  not  to  expand  in  depth  but  only  to  describe,  we  will  mention, 
two  areas  from  point-to-point  networks  and  one  from  radio 
networks.  The  first  two  concern  flow  control  and  integrated 
switching,  respectively,  while  the  third  concerns  the  problem  of 
scheduling  transmission  in  multihop  networks,  inlike  the  cases  of 
routing  and  multiple  access,  these  areas  have  not  yet  fully 
benefitted  from  the  use  of  control  theoretic  approaches  although 
such  approaches- would  be  very  well  suited  to  them  indeed. 


940 

A.  Flow  Control 
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A  stark  reality  in  the  design  of  networks  is  that  despite  the 
reduction  of  the  cost  of  memory,  storage  at  each  node  is  going  to 
be  finite.  Coupled  with  another  reality,  namely  that  data  transmis¬ 
sions  on  the  whole  continue  to  be  bursty,  it  implies  that  buffer 
overflow  may  occur  and,  along  with  it,  congestion  and  deadlocks. 
Flow  control  is  the  name  we  use  to  describe  the  collection  of 
measures  taken  to  avoid  buffer  overflow  and  highly  congested 
nodes  in  the  network.  Congestion  and  saturation  are  often  the 
consequences  of  diverging,  unstable  behavior.  Thus,  it  is  of 
interest  not  only  to  optimize  over  possible  flow  control  strategies, 
but  to  determine  their  robustness  against  disturbances  or  modeling 
inaccuracies  that  may  lead  to  unstable  behavior. 

The  control  variables  in  flow  control  problems  are  admission 
(or  blocking)  probabilities  for  messages  or  sessions  at  the  source 
node.  In  practice  these  are  often  implemented  in  terms  of  a  bang- 
bang  control  strategy  known  as  window  flow  control  whereby 
input  ports  are  allowed  to  continuously  inject  messages  into  the 
network  at  the  full  desired  input  rate  until  the  number  of 
unacknowledged 6  messages  exceeds  the  value  of  the  “window 
size"  w.  A  simple,  yet  unanswered  question  is,  what  should  the 
value  of  tv  be? 

Previous  efforts  to  use  control  theory  tools  to  analyze  optimal 
flow  control  problems  include  [28]  and  [46]  where  the  optimality 
of  window  flow  control  is  proved  within  the  domain  of  a 
simplified  model,  and  [39]  where  dynamic  programming  value 
iteration  techniques  are  used  to  characterize  optimal  flow  control 
performance.  An  alternative  approach  to  the  flow  control  problem 
is  to  subsume  it  into  the  static  routing  problem  considered  in 
Section  II-A  [19]:  suppose  that  for  every  source-destination  pair  a 
fictitious  direct  link  is  added  between  them.  We  can  then  interpret 
the  blocking  action  of  a  flow  control  procedure  as  a  diversion  of 
the  blocked  portion  of  the  traffic  through  this  fictitious  link  to  the 
destination.  Thus,  we  can  consider  that  no  traffic  is  blocked.  Of 
course,  in  order  to  discourage  the  use  of  this  ,  rtitious  link  we 
must  augment  the  overall  delay  cost  function  with  a  term  that 
penalizes  appropriately  the  use  of  this  link. 
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Fig.  7.  Switching-type  optimum  policy  for  integrated  switching. 


therefore,  N  =  W/  Vbe  the  maximum  number  of  calls  that  can  be 
assigned  dedicated  circuits  simultaneously  if  no  data  packets  are 
transmitted.  A  voice  call  can  either  be  accepted  (and  assigned  the 
necessary  bandwidth  V)  or  blocked.  Data  packets  can  be  stored  in 
a  buffer  facility.  If,  at  a  given  time,  there  are  /  calls  in  the  system, 
the  data  packets  can  be  served  at  the  full  rate  corresponding  to  the 
remaining  bandwidth  W  -  iV.  Such  a  switching  architecture 
represents  what  has  been  called  the  movable  boundary  idea  in 
integration.  A  natural  MDP  can  be  simply  formulated  as  follows: 
choose  the  control  action  of  blocking  or  accepting  a  call  upon 
arrival  in  order  to  minimize  the  weighted  sum  of  the  average  data 
packet  delay  and  the  call-blocking  probability.  If  we  assume  that 
both  arrival  streams  (voice  calls  and  data)  are  independent  Poisson 
processes,  that  the  call  holding  time  is  exponentially  distributed, 
and  that  the  message  lengths  are  likewise  exponential,  we  can 
apply  the  technique  described  in  Section  II  of  converting  the  MDP 
to  an  LP  and  show  that  the  optimal  policy  has  the  useful 
switching-type  form.  Namely,  if  i  is  the  number  of  ongoing  calls 
and  j  the  total  number  of  data  messages  at  the  node,  the  optimal 
control  action  should  be  to  block  the  call  in  region  B  of  the  state 
space  as  shown  in  Fig.  7  and  to  accept  it  in  region  A. 

C.  Link  Scheduling 


B.  Integrated  Switching 

A  revolutionary  development  in  the  field  of  networks  whose 
implementation  is  currendy  under  way  is  the  combination  of  the 
capabilities  of  what  have  been  separately  developed  in  the  past  and 
called  voice  networks  and  data  networks.  Voice  is  a  commodity 
that  must  meet  different  requirements  than  data.  For  example, 
speech  signals  have  inherent  redundancy  that  make  them  quite 
robust  with  respect  to  occasional  errors  or  deliberate  compres¬ 
sion.  At  the  same  time,  except  in  applications  of  voice  messaging, 
speech  signals  occur  in  the  context  of  real-time  conversations  and, 
as  such,  must  encounter  short  and,  more  importantly,  constant 
delay.  On  the  other  hand,  data  must  preserve  their  integrity  and 
cannot  tolerate  errors,  however,  long  and  variable  delays  can  be 
often  tolerated. 

How  does  one  design  a  single  network  that  can  handle  such 
dissimilar  commodities  with  automated  procedures?  The  natural 
course  of  events  in  the  last  decade  or  two  was  to  attempt  to  force 
data  on  primarily  voice  networks  or  to  let  voice  ride  on  what  were 
mainly  data  networks.  The  literature  is  full  of  ideas  for  baseline 
integration  that  ate  mostly  heuristic  and  difficult  to  analyze.  An 
attempt  to  formulate  the  problem  of  integrated  switching  as  an 
optimization  problem  was  presented  in  [50] .  In  it  simplest  form 
the  model  is  as  follows:  consider  a  single  node  in  the  network  with 
a  single  outgoing  link  on  which  incoming  voice  calls  and  data 
packets  must  be  multiplexed.  Let  W  be  the  bandwidth  of  the 
outgoing  link.  Let  V  be  the  bandwidth  required  for  the  continu¬ 
ous,  uninterrupted  accommodation  of  a  single  voice  call.  Let, 


Let  us  now  turn  our  attention  back  to  the  radio  network 
environment.  In  Section  in  the  multiple  access  channel  was 
considered  and  a  number  of  difficult  but  interesting  control 
problems  were  identified.  Throughout  that  discussion,  it  was 
assumed  that  all  terminals  are  within  a  single  transmission  hop 
from  the  destination.  In  many  radio  networks,  however,  this  is  not 
the  case.  Messages  need  to  be  relayed  via  intermediate  nodes  to 
their  final  destinations.  Thus,  the  familiar  problem  of  routing 
arises  again,  except  that  this  time  there  is  a  new  twist  to  it.  In 
point-to-point  networks,  transmissions  between  different  node 
pairs  can  take  place  simultaneously  because  there  are  dedicated, 
“hard-wired”  links  between  the  corresponding  nodes.  In  a  radio 
(or,  more  generally,  in  a  multiaccess/broadcast)  environment,  if 
the  nodes  are  densely  connected,  not  all  transmissions  can  take 
place  simultaneously  (unless  separate  dedicated  channels  or 
simultaneous  transmission  signaling  techniques  (Section  HI-B)  are 
used).  They  must  be  scheduled  in  time  to  avoid  the  interference 
that  would  occur  otherwise. 

It  becomes  evident  that  the  mere  fact  that  the  transmission 
among  a  group  of  nodes  must  take  place  one  at  a  time  raises  the 
question  whether  the  intended  transmissions  are  routing-wise 
optimal  any  more.  Several  versions  of  this  problem  have  been 
studied  [3],  [23],  [36].  In  every  case  and  even  if  the  routing 
problem  is  sidestepped,  we  are  led  to  hard  combinatorial 
optimization,  problems  where  questions  of  computational  com¬ 
plexity  and  distributed  implementation  are  of  primary  importance. 

V.  Conclusion 


4  Note  the  implicit  assumption  of  delayed  feedback  information  from  the  It  should  be  clear  by  now  that  the  theory  of  linear  and  nonlinear 
destination  to  the  source  node.  optimization,  dynamic  programming,  stochastic  control,  stability 
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analysis,  and  distributed  control  have  found  interesting  applica¬ 
tions  arising  in  the  analysis  and  design  of  communication 
networks.  Unlike  other  complex  systems  that  have  been  success¬ 
fully  studied  by  control  sys'em  theorists  in  the  past  (such  as 
|  chemical  plants,  flexible  aircraft,  robot  systems,  etc.),  communi- 
<  cation  networks  stand  out  in  that  the  commodity  to  be  controlled  is 
information  (including  its  transmission,  storage,  processing,  etc.). 
This  feature,  perhaps,  misleads  and  intimidates  those  who  do  not 
feel  sufficiently  inter-disciplinarian  to  tackle  these  problems.  We 
hope  that  by  having  selected  to  present  a  few  examples  in  which 
concrete,  purely  control-theoretic  problems  can  be  formulated  and 

•  have  been  (or  can  be)  studied  successfully,  we  may  encourage 
|  attention  by  the  control  community  to  this  application  area  that  is 

*  especially  rich  in  new  challenges. 

As  stated  from  the  outset,  we  did  not  attempt  to  survey  or 
completely  cover  the  multiple  control  facets  of  communication 
networks.  The  collection  in  this  paper  merely  represents  an  effort 
to  illuminate  a  few  selected  problem  areas  and  to  show  how 
control  techniques  apply  to  them. 
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Abstract— A  decentralized  control  algorithm  Is  sought  that  maximizes 
the  stability  region  of  the  infinite-user  slotted  multipacket  channel  and  is 
easily  implemcntable.  To  this  end,  the  perfect  state  information  case 
where  the  stations  can  use  the  instantaneous  value  of  the  backlog  to 
compute  the  retransmission  probability  is  studied  first.  The  best  through¬ 
put  possible  for  a  decentralized  control  protocol  is  obtained,  as  well  as  an 
algorithm  that  achieves  it.  Those  results  are  then  applied  to  derive  a 
control  scheme  when  the  backlog  is  unknown,  which  is  (he  case  of 
practical  relevance.  This  scheme,  based  on  a  binary  feedback,  is  shown  to 
be  optimal  given  some  restrictions  on  the  channel  multipacket  reception 
capability. 


I.  Introduction 

MOST  studies  on  random  access  communications  rely  on  the 
assumption  that  when  two  or  more  packets  overlap,  all  the 
information  that  was  sent  is  irremediably  lost,  hence  the  need  to 
repeat  all  transmissions  at  some  later  time.  This  is  actually  a 
pessimistic  point  of  view,  since  there  are  many  examples  of 
random  access  systems  where  one  or  more  packets  may  be 
successful  m  the  presence  of  other  simultaneous  transmissions.  In 
order  to  represent  such  random  access  systems,  a  model  for  a 
channel  with  multipacket  reception  capability  has  been  developed 
in  [6]-[8].  We  consider  a  slotted  channel  with  an  infinite 
population  of  users,  and  we  assume  that  the  probability  of  having 
k  successes  in  a  slot  where  there  are  n  transmissions  depends  only 
on  the  collision  size  n 

ink~P[k  packets  are  correctly  received  |u  are  transmitted] 

(na  1,  0 sk£n). 

We  define  the  reception  matrix  as 


This  model  can  be  applied  to  channels  with  capture  (1]-[3J,  [10], 
[16],  [18],  [20],  [23],  [26],  [28],  [34]  and  to  systems  using 
CDMA  [22],  [24],  [29].  It  is  also  relevant  for  many  other 
applications,  such  as  systems  with  multiuser  detectors  [33]  or,  for 
instance,  the  channel  studied  in  [17],  [31].  For  more  details  about 
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this  model,  the  reader  is  referred  .o  [6]  and  [8],  Denoting  by  C„  = 
££„,  kenk  the  average  number  of  packets  correctly  received  in 
collisions  of  size  n,  we  assume  that  the  limit  C  =  lim„_«  C„ 
exists,  as  is  usually  the  case  with  models  of  practical  interest.  It 
has  been  proved  in  [8]  that  the  Aloha  random  access  algorithm  has 
a  maximum  stable  throughput  7j0  =  C  in  the  multipacket  channel. 

Decentralized  control  strategies  have  been  shown  [11],  [12], 
[19],  [25],  [30]  to  stabilize  the  slotted  Aloha  algorithm  in  the  case 
of  the  usual  collision  channel,  hence,  it  is  reasonable  to  expect  that 
when  those  strategies  are  used  in  the  multipacket  channel,  the 
resulting  throughput  will  be  higher  than  tj0.  We  consider  schemes 
of  the  form 

Pn  =  F(Sn) 

Sn+l  =  G(Sn,  Zn)  (1) 

where  pn  is  the  retransmission  probability  in  slot  n,  S„  is  an 
estimate  of  the  backlog  X„  at  the  beginning  of  slot  n,  and  Z„  is  the 
feedback  at  the  end  of  slot  n.  The  number  of  new  packets  arriving 
during  slot  n,  An  is  assumd  to  form  a  sequence  of  i.i.d.  random 
variables  with  probability  distribution  P[A„  ~  k]  -  A k(k  a  0), 
such  that  the  mean  arrival  rate  A  =  ,  n\„  is  finite.  Each  of  the 

An _  |  new  packets  that  arrived  during  slot  n  -  1  is  transmitted  in 
slot  n  with  probability  pn. 

As  m  the  case  of  conventional  channels,  it  is  useful  to  study  first 
the  case  of  control  with  perfect  state  information  where  the  value 
of  the  backlog  is  given  to  the  users  prior  to  the  selection  of  the 
retransmission  probability.  To  keep  track  of  the  exact  value  of  the 
backlog,  a  central  controller  is  usually  necessary,  which  is  an 
unreasonable  requirement  for  most  practical  random  access 
channels.  However,  the  study  of  the  perfect  state  information  case 
allows  us  to  determine  an  upper  bound  to  the  best  throughput 
achievable  by  any  decentralized  control  of  the  form  (1),  and 
suggests  a  simple  implementation.  Those  results  are  in  turn  helpful 
to  derive  control  protocols  in  the  case  where  the  backlog  is 
unknown.  This  is  done  in  Section  III  where  we  consider  a  backlog 
estimate  which  is  recursively  updated  using  the  binary  feedback 
empty /nonempty.  In  addition,  it  is  assumed  throughout  the  paper 
that  each  station  is  informed  when  its  packet  is  successfully 
received.  It  is  proved  that  provided  a  certain  condition  on  the 
reception  matrix  holds,  the  throughput  achievable  with  this  type  of 
feedback  is  the  same  as  the  perfect  state  information  throughput 
This  condition  is  verified  for  most  multipoint-to-point  channels  of 
practical  interest. 

In  a  paper  whose  translation  appeared  only  very  recently  [19] 
(after  our  work  [7]),  Mikhailov  has  derived  sufficient  conditions 
for  stability  and  instability  of  two-dimensional  Markov  chains. 
Although  this  was  meant  to  be  used  for  decentralized  control 
schemes  in  the  usual  collision  channel,  this  approach  is  powerful 
enough  to  be  applied  to  the  multipacket  channel.  In  Section  IV  we 
show  by  using  Mikhailov’s  result  that  the  scheme  presented  in 
Section  III  is  stable  under  weaker  assumptions.  However,  only  a 
weaker  form  of  stability  can  be  proved  in  this  way. 
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II.  Control  of  the  Multipacket  Channel  with  Perfect 
State  Information 

In  this  section  we  assume  that  all  the  users  know  the  value  of  X„ 
at  the  beginning  of  slot  n,  and  we  let  the  retransmission 
probability  be  a  function  of  the  exact  value  of  the  backlog,  i.e.,  pn 
-  F(Xn).  In  this  ideal  case,  the  system  is  much  simpler  to 
analyze  than  in  the  general  case  (1)  since  (Xn)n* o  is  a  homogene¬ 
ous  Markov  chain.  Our  goal  is  to  determine  the  optimal  control 
function  F*  that  yields  the  largest  ergodicity  region,  and  die 
corresponding  throughput,  denoted  by  ijc.  For  instance,  it  is  well 
known  [4]  that  for  the  usual  collision  channel  with  the  access  rule 
in  effect  here,  F*(X„)  =  l/X„  is  the  retransmission  probability 
that  minimizes  the  drift  at  each  step,  resulting  in  an  ideal 
throughput  of  jjc  =  e~l. 

First  note  that  all  the  results  herein  are  valid  provided  that  the 
backlog  Markov  chain  ( X„ ,  S„)n£o  corresponding  to  a  control  (1) 
is  irreducible  and  aperiodic.  It  can  be  easily  checked  that  for  both 
access  rules  considered  in  this  paper  (see  below),  as  well  as  all  the 
algorithms,  a  simple  set  of  sufficient  conditions  for  irreducibility 
and  aperiodicity  is 

a)  Xo=£0 

00 

b)  \>+  X/je„/i<l 


which  becomes  dn(p)  =  X  -  /„(/?)  if  we  define  t„(p)  to  be  the 
average  number  of  successes  given  the  backlog  n  and  the 
retransmission  probability  p 

'«(/»  =  2  (”)py(1-P)'"yQ.  (5) 

Since  t„(p)  is  a  polynomial  on  the  compact  [0,1],  it  achieves  its 
maximum  and  we  can  define 

p*  =  arg  max  /„(/>)  =  arg  min  d„(p). 

p6[0.l|  pS[0,H 

We  now  proceed  to  compute  the  limit  of  the  drift  when  the 
retransmission  probability  p*  is  used.  We  show  that 

00  fj 

lim  t,(pj)  =  sup  e~x  V  C„  ^-=sup  t(x).  (6) 

xaO  ",  n!  jrao 

Let  us  first  assume  that  C  <  +  co. 

Property  1: 

lim  t(x)  =  C. 

X-*« 

We  have  for  n  >  M 


c)  eio^O 

-which  are  analogous  to  the  conditions  for  the  open-loop  system 
fstudied  in  [6].  The  theorem  below  gives  the  best  throughput 
f  possible  for  a  control  protocol  (1). 

Theorem  1:  There  exists  a  retransmission  probability  p*  that 
minimizes  the  expected  backlog  increase  when  the  backlog  is 
equal  to  n. 

With  such  a  retransmission  probability,  the  system  is  stable  for 
X  <  i?f  and  unstable  for  X  >  i)e,  with 

!  “  x" 

*  7k=sup  e~x  J  C„-. 

jraO  ,  n! 


Proof  of  Theorem  1:  The  proof  is  based  on  standard  drift 
analysis  techniques.  (Xf),^  is  a  homogeneous  Markov  chain 
which  evolves  according  to 

j|  X,+  i=Xt+A,— 2(  (2) 

where  E,  is  the  number  of  packets  successfully  transmitted  in  slot 
t.  The  system  is  defined  to  be  stable  if  (X,),^  is  ergodic  and 
unstable  otherwise.  Let  dn  be  the  drift  of  X,  at  state  n:  dn  = 
E[X,¥  i  -  X,\X,  =  /»].  We  have  0  <  E,  £  X„  and  if  we  denote 
by  p  the  retransmission  probability  used  in  slot  /,  then  for/i  >  1, 
»|e  probability  of  having  k  successes  is  given  by 

P[l,=k\Xt=n]  =  'Z^pi{\-py-Jtij  (1  sk*n).  (3) 


ft  then  follows  from  (2)  that  the  backlog  drift  at  state 
liven  by 


Jtml  j.lt  '  J 


n  >  1  is 


\t{x)-C\se-xC+e~x  §  ^|C„-C|+  £  ~|Cn-C|. 


n»M+  I 


(7) 

Pick  e  >  0  and  fix  M such  that  |C„  -  C|  <  e  for  n  >  M.  Then  if 
Bc  is  an  upper  bound  on  the  sequence  (Cn)n!t\,  (7)  yields 

m  xn 

\t(x)-C\se-xC+2Bce-x  £  — +<= 

n»  I 

and  the  right-hand  side  of  this  last  equation  goes  to  zero  as  x  goes 
to  infinity. 

Property  2:  For  all  e  >  0,  there  exists  A  >  0  such  that  for  all 
np  >  A,  | t„(p)  -  C|  <  e.  We  have 

| t*(p)~  cl  *  2  (  J  )  p'O -p)"’1  I  Ci~  Cl  +  (1  ~P)nC. 
Choosing  M  as"  for  Property  1  we  get 


|<.(p)-C|s2*eS  (j^pi(l-py-J+t. 

Let  us  denote  by  R„  the  random  variable  corresponding  to  the 
number  of  retransmissions  in  a  slot  given  that  the  backlog  is  equal 
to  n.  We  have 


s  pH}-py-‘=P[R^M]^P 


for  np  >  2 M.  Then  from  the  Chebyshev  inequality 


P[Rn-£M\Z 


4_ 

np 


(8) 


/>'(!  ~PY-}Cj 


and  Property  2  follows. 

(4)  Property  3:  t„(x/n)  converges  uniformly  to  t(x)  on  any 
compact  (0 ,  A]. 
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Fix  e  >  0  and  choose  A/ such  that  H?mUAJC/jl  <  e.  Then  for 
n  >  M  +  1  and  .v  €  [0,  A] 


M 


/■i 


n(n-  1)  (n-j+  1) 


n-y 


+  2e. 


Since  lim„-„  n(n  -  1)  •  •  •  (n  -  j  +  1  )/ni  -  1  for  1  <  j  <,  M, 
it  is  enough  to  show  that  (1  -  x/n)n~J  converges  uniformly  to  e~x 
for  1  <  y  <  M.  We  have 


n-J 

-e 


~x£e~x[e*},n—  1]  <eAM/n  —  1. 


On  the  other  hand,  for  n  >  A, 


(9) 


and  uniform  convergence  follows  from  (9)  and  (10). 

Property  4:  t„(x/ri)  converges  uniformly  to  t(x )  for  xsO. 
Fix  e  >  0.  From  Properties  i  and  2  we  can  fix  A  such  that: 

i)  for  all  np  >  A,  | t„(p)  -  C|  <  e, 

ii)  for  all  x  >  A,  |f(x)  -  C|  <  e. 

Then  we  distinguish  two  cases.  If  x  £  [0,  A),  then  from 
Property  3  there  exists  N  such  that  for  all  n  >  N,  \t„(x/n)  - 
f(x)|  <  £.  If  on  the  other  hand  x  £  (A,  +  oo),  we  have 


£ 


tn 


+  |?(x)-C|^2£ 


(11) 


from  i)  and  ii). 

Thus,  we  have  shown  that  when  C  is  finite,  tn{x/n)  converges 
uniformly  to  t(x)  forx  2:  0.  It  follows  that  lim„-®  supxa0  t„(x/rt) 
=  supxao  t(x)  and  so  (6)  is  proved. 

Finally,  we  show  that  (6)  hoius  when  C  =  +  oo.  Choose  A 
arbitrarily  large  and  M  such  that  C„  >  A  for  n  >  M.  Then  for  n 
>  M 


tn 


n~J*A(\  ~P[RnsM])- 


From  (8)  P[Rn  <  M]  is  arbitrarily  small  for  nx/n  =  x  large 
enough.  Therefore,  supxa0  tnfx/ri)  =  +  «  and  lim„-*  tn(p*)  = 
+  <».  Since  it  is  clear  that  if  C  =  +  <»,  then  sup^o  t(x)  =  +  oo, 
(6)  holds. 

From  the  equality  limn-*  d„(p*)  =  X  -  supxi0  t(x)  and  Pakes 
Lemma  in  [21],  it  follows  that  if  limn-®  C„  =  +  oo,  then  limn-® 
dniPt)~  ~  °°-  and  1116  system  is  always  stable,  whereas  if  lim„-„ 
C„  <  +oo,  then  (A'n)nao  is  ergodic  for  X  <  rjc  =  supxa0  t{x). 
Also,  it  is  shown  in  the  Appendix  that  Kaplan’s  condition  holds 
for  this  system  when  the  sequence  (Cn)nai  is  bounded,  thus  from 
Kaplan’s  result  [13],  the  backlog  Markov  chain  is  nonergodic 
when  X  >  jjc.  □ 

It  is  intuitively  obvious  that  no  decentralized  control  algorithm 
of  the  form  (1)  can  have  a  maximum  stable  throughput  larger  than 
»je.  The  theorem  below  gives  a  rigorous  proof  of  this  fact  and  also 
shows  that  this  throughput  can  be  achieved  with  a  control  which  is 
much  simpler  than  p*. 

Theorem  2:  The  best  throughput  achievable  by  a  decentralized 
control  algorithm  (1)  is  t\c  =  supx:soe"Jf2"ii|x',//i!  Cn.  Ifi?c  >  C 
=  lim„-®  C„,  then  there  exists  a  constant  A  >  0  such  that  the 
control  p,  =  A/X,  for  X,  >  A  yields  the  optimal  throughput  j?c. 


Proof  of  Theorem  2:  To  prove  the  first  part  of  the  theorem 
we  use  a  result  of  [27]  which  is  a  generalization  of  Kaplan’s 
Theorem.  If  p,  =  F(S,)  and  S,+  i  =  G(St,  Z,)t  consider  the 
Markov  chain  (X,,  S ,)  and  the  Lyapunov  function  V(n,  s)  =  n. 
Assume  that  X  >  ijc-  Then 


E[V(X,+  U  S,+  1)-  V(X„  S,)\X,=n,  S,=s) 

=x-2  -nsrr-'c, 

zdAe;)*^  <12> 

for  all  n  large  enough  and  all  s.  Therefore,  the  drift  of  V  is  strictly 
positive  outside  a  finite  subset  of  the  state  space.  Since  it  is  shown 
in  the  Appendix  that  the  generalized  Kaplan’s  condition  is 
verified,  it  is  enough  to  conclude  that  (X,,  5, )  is  nonergodic. 
Hence,  t)c  is  indeed  the  best  throughput  achievable  by  any 
decentralized  control  algorithm  of  the  form  (1). 

To  prove  the  second  part  of  the  theorem,  we  need  the  following 
property. 

Property  5:  If  for  all  x  >  0,  r(x)  <  supx£o  /(*),  then  supxi0 
/(x)  =  C. 

If  supxa0  t{x)  =  +  oo,  it  is  easily  seen  that  C  =  +  oo.  If  supxa0 
f(x)  <  +os,  then  C  <  +oo.  Consider  a  sequence  (xn)nal  of 
nonnegative  reals  such  that  lim„-„  f(x„)  =  supx20  t(x).  If  (xn)nal 
was  bounded  above  by  K  <  +  oo,  we  would  have  for  all  n  >  1, 
t(xn)  s  supx6(0.^  r(x),  and  in  the  limit  supxa0  t(x)  =  supx610,jq 
/(x).  Then  there  would  exist  x0  £  [0,  K\  such  that  f(x0)  =  supxa0 
/(x),  which  is  a  contradiction.  Therefore,  (x„)n2i  is  unbounded, 
and  one  can  build  a  subsequence  (x„*;)*j.i  such  that  lim*-*  x„k  = 

+  co.  We  still  have,  of  course,  lim*-®  t(x„k)  =  supx!:o  t(x),  but 
on  the  other  hand,  we  have  lim*-*  t(x„k)  =  limx-®  /(x).  From 
Property  1  in  the  proof  of  Theorem  1,  limt_„  t(x)  =  C  and 
Property  5  follows. 

Thus,  if  ijc  >  C,  then  /(x)  achieves  its  supremum  at  some  finite 
positive  real  A.  Let  us  consider  the  control  p,  =  A/X,  for  X,  > 
A.  (Note  that  the  value  of  the  retransmission  probability  is  left 
unspecified  for  X,  <  A  because  it  does  not  affect  the  throughput.; 
Then  from  (4)  dn  =  X  -  t„(A/n),  and  from  Property  3  m  the 
proof  of  Theorem  1  lim„-®  d„  =  X  -  t{A).  Then  it  follows  from 
[21]  that  (X,),>0  is  ergodic  if  X  <  t(A)  and  from  [13]  and  the 
Appendix  that  (Af,),ao  is  nonergodic  if  X  >  t(A).  Thus,  the 
maximum  stable  throughput  of  the  system  is  t(A)  =  supxa0  t(x) 
=  rjc.  □ 

Note  that  the  closed-loop  throughput  obtained  in  Theorems  1 
and  2  can  be  interpreted  as  i?c  =  supAr_P(xM>o£[C^],  that  is  as  the 
supremum  over  x  of  the  expected  value  of  CN  if  TV  is  a  Poisson 
distributed  random  variable  with  mean  x.  Note  that  if  we  were  to 
follow  the  popular  approximation  [1],  [2],  [10],  [16],  [18],  [24], 
[26]  that  assumes  that  the  number  of  transmissions  in  each  slot,  N, 
is  Poisson  distributed,  and  if  we  could  choose  any  positive  number 
as  the  mean  of  N  by  regulating  the  retransmission  probability,  the 
throughput  would  be  equal  to  the  average  number  of  successes  per 
slot,  E[Cs],  maximized  over  the  mean  of  N.  As  in  the  usual 
collision  channel,  a  wrong  analysis  leads  to  a  correct  conclusion. 
Several  examples  are  gathered  in  Table  I  (see  [8]  for  details). 

Probably  the  most  important  conclusion  of  this  section  is  that  in 
general  it  is  not  necessary  to  compute  the  exact  value  of  p*,  which 
would  require  a  large  amount  of  on-line  computations,  and 
seriously  hinder  any  application  of  Theorem  1  to  the  case  where 
the  backlog  is  unknown.  Two  cases  may  occur.  If  t(x)  does  not 
attain  its  supremum,  from  Property  5  in  the  proof  of  Theorem  2, 
we  have  rje  =  ij0  =  C  (e.g.,  this  happens  in  the  model  developed 
in  [6]  for  mobile  users  with  pairwise  transmissions).  In  this  case 
no  throughput  improvement  can  be  achieved  by  varying  the 
retransmission  probability,  and  therefore  it  is  enough  to  restrict 
attention  to  the  open-loop  strategy  studied  in  [8].  On  the  other 
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TABLE  I 

open-loop  and  closed-loop  throughputs  for  several 

MULTIPACKET  CHANNELS 


noise  0  <  C,  ^  1  and  C„  =  0  for  n  >  2.  Then  T(x)  =  C,  e-*(X, 
+  \oX)  and  T'(x)  =  Cte~x(\1  -  X,  -  X<a).  Therefore,  for  any 
distribution  such  that  Xo  <  X|,  T(x)  is  maximum  at  T(Q)  +  C|X|, 
and  the  stability  region  is  empty  since  QX,  <  Xj  <  X.  Note  that 
in  this  sense,  the  immediate  first  transmission  does  not  perform  as 
well  as  the  delayed  first  transmission  with  which  the  system  can 
always  be  stabilized. 

If  there  are  solutions  to  X  <  sup^o  T(x),  then  the  best 
throughput  achievable  by  the  class  of  algorithms  in  (1)  is  =  sup 
{X:X  <  sup,20  T(x)}.  This  is  what  happens,  for  instance,  when 
the  new  packet  arrivals  are  Poisson  distributed. 

Theorem  3:  If  the  new  packet  arrivals  are  Poisson  distributed, 
the  best  throughput  achievable  with  an  IFT  rule  is  the  same  as  in 
the  DFT  case,  vc  =  supxi0  /(*). 

Proof  of  Theorem  3:  If  lini„-„  Cn  =  +  oo,  then  tjc  ~  vc  = 
+  oo.  Assume  now  that  C  <  +  oo.  We  get 


»  Yn  »  X* 


r(x)  =  e-(,+x>  £  i.  2  2  cn+* 


n»0  k-0  k! 


! 

! 

! 

i 

t 


i 


hand,  if  there  exists  A,  0  <  A  <  +c°,  such  that  t(A)  =  supxa0 
/(*),  then  we  have  shown  in  the  proof  of  Theorem  2  that  the 
control  p,  =  A/X,  for  X,  >  A  yields  a  maximum  stable 
throughput  ((A)  =  ijf,  meaning  that  the  system  is  optimal.  Hence, 
only  A  has  to  be  computed,  and  this  can  be  done  before  starting 
the  operation  of  the  system. 

Although  in  most  practical  applications  (C„)„  2:  1  does  have  a 
limit,  it  is  worth  noticing  that  Theorem  1  can  be  generalized  to  the 
case  where  C  does  not  exist.  It  can  be  shown  [9]  that  if  the  drift  is 
minimized  at  each  step,  then  the  system  is  stable  for  X  <  sup,^ 
t(x)  and  unstable  for  X  >  sup^o  f(x)  +  lim„-„  sup  C„  -  lim„-„ 
inf  C„.  As  in  the  open-loop  system  when  (C„)n5:  (  does  not  have  a 
limit,  nothing  more  can  be  said  about  the  throughput  without 
further  information  on  the  sequence  (£?„)„*  (.  But  the  main 
drawback  in  such  a  case  is  that  there  may  not  exist  any  control  p„ 
=A/Xn  that  yields  the  optimal  throughput. 

The  access  rule  for  new  packets  that  we  have  been  considering 
so  far  is  usually  referred  to  as  delayed  first  transmission  (DFT;. 
With  this  access  rule,  newly  arrived  packets  are  treated  exactly  in 
the  same  way  as  backlogged  packets.  Let  us  now  examine  what 
happens  when  on  the  contrary  an  immediate  first  transmission 
(IFT)  rule  is  used,  that  is  when  new  packets  are  transmitted  with 
probability  one  in  the  slot  immediately  following  their  arrival.  It 
has  been  proved  in  [8]  that  the  open-loop  throughput  .is  the  same 
for  both  first  transmission  rules.  The  closed-loop  throughput  on 
the  other  hand  depends  on  the  access  rule.  For  instance,  it  is  well 
known  [4]  that  for  the  usual  collision  channel  in  the  IFT  case,  the 
optimal  retransmission  probability  is  p*  =  X<>  -  Xj/Xo n  —  Xi, 
yielding  an  optimal  throughput  XoeVV-1,  in  contrast  to  the 
throughput  i)c  =  e~l  for  the  DFT  case.  In  the  multipacket  channel 
with  the  IFT  rule,  the  optimal  throughput  depends  not  only  on  the 
mean  but  on  the  whole  distribution  of  new  packet  arrivals. 
Interestingly  enough,  it  can  be  proved  that  both  throughputs 
coincide  when  the  new  packet  arrivals  are  Poisson  distributed. 
Still  with  the  same  method  as  in  the  proof  of  Theorem  1 ,  it  can  be 
easily  shown  that  there  exists  a  retransmission  probability  that 
minimizes  the  drift  dn  at  state  n.  With  such  a  retransmission 
proba 
T(x) 
xn/n\ 

convenience.  It  can  also  be  proved  that  a  control  of  the  form  p„  = 
A/X„  yields  a  maximum  stable  throughput  T(A).  Since  sup,20 
T(x)  depends  on  the  whole  new  packet  arrival  distribution 
fX„)„2o,  this  result  is  not  as  conclusive  as  in  the  DFT  case.  This  is 
because  the  stability  region  X  <  sup,i0  T(x)  is  actually  given  in 
the  form  of  an  implicit  equation  in  X,  which  cannot  be  solved  in 
general  without  further  specifications  on  the  distribution  (X*),.^- 
For  instance,  this  stability  region  could  be  empty.  Consider,  for 
example,  the  usual  collision  channel  with  possibly  some  added 


bility,  the  system  with  IFT  rule  is  stable  for  X  <  sup^c 
and  unstable  for  X  >  sup,**,  T(x),  with  T(x)  =  e~x  £" 
£»n  X.C„+/,  where  we  have  defined  C0  =  0  for  notationaJ 


=  e-(*+X)  ^  £;  (*+x)\  (13) 

n  ■  1 

Thus,  in  this  case,  T(x)  depends  only  on  X,  and  to  clarify  the 
proof  below,  we  denote  it  by  Tx(x) 

7\(x)  =  t(x+\).  (14) 

Assume  that  t(x)  does  not  achieve  its  supremum.  Then  from 
Property  5  in  the  proof  of  Theorem  2,  we  have  Vc  =  C  =  limt_„ 
t(x)  It  follows  from  (14)  that  for  any  X  >  0,  lim,-.„  Tx(x)  =  C. 
Therefore,  for  all  X  >  0,  sup, so  Tx(x)  >  C.  Hence,  for  all  X  > 
0,  sup, 20  Tx(x)  =  sup, 20  t(x),  and  by  definition  of  vc,  we  Finally 
get  vc  -  sup,20  t(x).  Note  that  Tx  does  not  achieve  its  supremum, 
in  the  sense  that  if  there  existed  X  6  (0,  vc)  and  x-x  2s  0  such  that 
vc  =  Tx(Xx),  we  would  have  sup,20  t(x)  =  t(\  +  xx). 

Assume  now  that  t(x)  does  achieve  its  supremum.  there  exists 
x0  >  0  such  that  sup,a0  t(x)  =  t(x0).  Then  for  all  X  in  [0,  jc0]  . 
T\(x0  -  X)  =  sup,20  t(x )  >  sup, 20  Tx(x).  Thus,  for  all  X  €  [0, 

*o3 

sup  7\(x)  =  sup  t(x)=Tx(x0-\).  (15) 

xkO  xzO 

We  have  for  all  x  >  0  T(at)  5  x ,  therefore  sup,£0  t(x)  ^  Xq. 
Together  with  (15),  it  follows  that  for  all  X  €  (0,  sup,20  t(x)),  X 
<  sup, 20  T\(x),  and  therefore  vc  >  sup,a0  t(x)  =  »jc.  Since  from 
(14)  sup,20  Tx(x)  <  sup, 2o  t(x)  =  Vc  for  all  X,  we  get  vc  £  Vc  and 
finally  vc= J7e-sup,20  t(x).  Note  that  from  (14),  Tx  reaches  its 
supremum  too,  since  for  all  X  <  vc,  there  exists  X\  >  0  such  that 
T\(xx)  =  vc. 

Note  that  we  have  also  shown  in  this  proof  that  T(x)  reaches  its 
supremum  iff  t(x)  does,  which  means  that  Vc  can  be  achieved  with 
a  control  of  the  form  pn  =  A/X„  iff  vc  can.  □ 

in.  Optimal  Control  for  the  Multipacket  Channel 

It  is  assumed  from  now  on  that  the  users  do  not  have  access  to 
the  value  of  the  backlog,  so  the  problem  becomes  one  of  control  of 
the  Markov  chain  with  partial  state  information  provided  by  the 
channel  feedback.  We  build  a  backlog  estimate  S,  with  feedback 
which  is  such  that  Z,  =  0  if  slot  t  was  empty,  and  Z,  =0 
otherwise.  The  results  of  the  previous  section  strongly  suggest 
that  we  should  use  as  a  retransmission  probability  p,  =  A/S„ 
where  A  is  a  point  at  which  t(x)  achieves  its  supremum  (according 
to  Property  5,  A  is  assumed  to  be  finite).  We  show  that  the 
resulting  control  algorithm  achieves  the  optimal  maximum  stable 
throughput  Vc-  This  holds  provided  that  the  following  assumption 
on  the  reception  matrix  is  verified. 
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CO:  There  exists  0  >  0  and  B  such  that  for  all  n  5:  1,2;  .  en 
*  B. 

The  purpose  of  condition  CO  is  to  bound  the  probability  of 
having  large  numbers  of  simultaneous  successes.  Unbounded 
numbers  of  successes  per  slot  are  difficult  to  deal  with  because 
they  may  result  in  very  large  instantaneous  errors  in  the  backlog 
estimate.  Note  that  condition  CO  is  likely  to  hold  in  most 
multipoint-to-point  channels  because  of  practical  limitations  on 
the  receiver  capabilities,  and  that  it  is  verified  for  all  the  examples 
in  Table  I. 

Theorem  4:  Assume  that  there  exists  A  E  (0,  +  »)  such  that 
t(A)  =  sup,s0  f(x),  that  the  new  packet  arrivals  (A,)i *o  are 
exponential  type1,  and  that  condition  CO  holds.  If  a  <  0  and  0  < 
0  verify  the  following  two  conditions2: 

Cl:  0  >  X 

C2:  0(1  —  e~A)  +  T)c  -  X  +  ae~A  =  0 
then  the  control  algorithm  (cf.  the  control  laws  proposed  in  [15], 
[19],  and  [25]) 


A 


Fig.  1.  Drift  properties  (Proposition  1). 


S,+i  =  max  {A,  S,  +  a/(Z,  =  0)  +  jS/(Z,  =  5)} 

has  maximum  stable  throughput  equal  to  tjc. 

Proof  of  Theorem  4:  The  proof  is  based  on  the  method 
developed  in  [30].  The  idea  is  to  use  the  properties  of  the 
homogeneous  two-dimensional  vector  Markov  chain  of  the 
backlog  and  its  estimate  M,  =  (X,,  S,)  to  build  a  Lyapunov 
function  whose  drift  is  negative  in  the  first  quadrant  of  the  («,  s) 
plane  when  \  <  i)c  It  turns  out  that  this  fails  to  hold  in  two  cones 
of  the  state  space,  but  it  can  be  proved  that  the  7-step  drift  of  the 
Lyapunov  function  is  negative  for  some  integer  7,  and  that  this  is 
enough  to  ensure  that  M,  is  geometrically  ergodic.  It  follows  from 
Theorem  2  that  M,  is  nonergodic  if  X  >  ijc.  For  substantial 
portions  of  the  proof,  the  reader  is  referred  to  [9]  because  of  space 
limitations. 

Denote  by  X,  =  5,  -  X,  the  error  in  the  backlog  estimate.  The 
first  part  of  the  proof  mainly  consists  of  computing  and 
approximating  the  drifts  of  X,  and  X,  which  are  the  basic  building 
blocks  for  the  Lyapunov  function. 

Denote  by  c(n,  s )  =  E[X,+  l  -  X,\ Ms  =  (n,  s)]  the  backlog 
drift  at  state  ( n ,  s ),  and  by  d(n,  s)  =  E[X,+ 1  -  X,  \M,  =  ( n ,  s)] 
the  drift  of  the  backlog  error.  For  technical  reasons,  what  we  most 
often  use  in  the  proof  are  the  truncated  drifts,  which  correspond  to 
the  value  of  the  drifts  restricted  to  those  paths  where  the  variation 
in  the  backlog  is  bounded  by  some  integer  7,  that  is  c(n,  s,  7)  = 
£[(*,+  l-  X,)IQXI+1  -  X,\  £J)\M,  =  (n, $)]  and d(n, s,  J) 
=  E[(XI+ 1  -  X,)I(\Xl+l  -  X,\  sS7)|Af,  =  (n,  j)J.  Clearly, 
these  truncated  drifts  will  be  good  approximations  of  c(n,  s )  and 
d(n,  s),  respectively,  when  7  is  large.  It  will  tum  out  that  the 
drifts  depend  primarily  on  the  ratio  x  =  n/s  for  large  values  of  n 
or  s  Thus,  it  is  convenient  to  define  the  following  two  regions  in 
the  (/i,  s)  plane: 


C(Xo,  Xi)={(n,s) :  n>0,  s>0,  1 +  Xo=Sj£  1 +X| } 


Um=  {(n,  s)  :  n^M  or  s>M} 

where  Xo  and  X,  are  such  that  -  co<X<)SX|<  +°°.  The  aim 
of  the  first  part  of  the  proof  is  to  show  Proposition  1  below  which 
summarize:  all  the  properties  of  the  drifts  that  are  needed  for  our 
purposes  (see  Fig.  1). 


1  A,  is  exponential  type  if  there  exists  d  >  0  such  that  Ele**']  is  finite.  For 
instance,  this  is  true  if  A,  is  Poisson  distributed. 

1  Conditions  Cl  and  C2  define  half  a  straight  line  in  the  plane,  and  therefore 
an  infinite  number  of  possible  estimation  schemes,  all  of  them  yielding  the 
same  throughput. 


Proposition  1:  There  exist  y  E  (0,  1/5),  5  >  0,  and  an  integer 
70  >  0  such  that  for  all  7  >  7<>: 

i)  for  all  (n,s)  E  C ( -  57,  5y)  IT  UM,c(n,s)  <  -  5  and  c(n, 
s,  7)^-6  +  h(7); 

ii)  for  all  (n,  s)  E  C (  -  co,  -  y)  C\  UM,  d(n,  s)  <  -8  and 
d(n,  s,  7)^-5+  k(7); 

iii)  for  all  (n,  s)  €  C(7,  +  co)  D  UM,  d(n,  s)  >  6  and  d(n,  s, 
7)  >  5  -  V(J) 

where  v(J)  is  a  nonnegative  function  which  goes  to  zero  a  7  goes 
to  infinity. 

The  detailed  proof  of  Proposition  1  can  be  found  in  [9].  After 
computing  the  value  of  the  drifts 


c(0,  s)  =  \ 


(16a) 


c(n 


-5>  =  X“2  (  / 


\(  A\i 


A  \n-r 

1 —  )  Cj  (n>  1)  (16b) 


7(0,  s)  -  max  {A -s,  a}  -X 
(  A  \n 

d(n,  s)  =  j3-X  +  (max  {/1-s,  a}-0)  M  -  — 


n\{A\J  .  A  \n~J 


+5UA7 


1  — 
s 


(17a) 


Cj  (n>l)  (17b) 


we  work  out  upper  and  lower  bounds  by  truncating  the  sums  (16) 
and  (17)  to  a  fixed  number  of  terms,  and  then  we  approximate 
those  bounds  as  a  function  of  the  sole  variable  n/s.  The  mam  idea 
is  that  the  dynamic  behavior  of  the  Markov  vector  M,  =  (X,,  S,) 
depends  essentially  on  the  ratio  X,/S,.  For  instance,  if  x  is  nearly 
equal  to  1,  the  backlog  estimate  is  close  to  its  ideal  value,  and  we 
should  have  c(n,  s)  <  0  since  the  backlog  drift  is  negative  in  the 
perfect  state  information  case.  Also,  a  well-behaved  estimate 
should  be  such  that  if  x  <  1 ,  then  the  error  s  -  n  is  positive,  and 
therefore  should  have  a  negative  drift  d(n,  s)  <  0  (see  [15]).  In 
the  same  way,  we  expect  to  have  d(n,  s)  >  0  for  x  >  1. 

Let  us  define  the  following  Lyapunov  (unction; 


V(n,  s)  =  max  jn,  ( n-s ),  (s- 

where  the  constants  have  been,  chosen  so  that  V  is  continuous. 
V(n,  s)  is  equal  to  the  first,  second,  and  third  term  inside  the 
bracket  when  (n,  s)  is  in  C(-  37, 37),  C(37,  +  »),  and  C(  - 
-  37),  respectively.  Notice  that  Vh  defined  so  as  to  take  the  best 
advantage  of  the  drift  properties  listed  in  Proposition  1.  For 
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instance,  when  V(n,  s)  is  equal  to  n,  then  the  Markov  chain  M, 
belongs  to  C(-37,  37)  which  is  included  in  C(-57,  57)  where 
the  backlog  drift  is  negative  provided  that  either  n  or  s  is 
sufficiently  large.  Similar  comments  can  be  made  about  the  other 
two  regions.  Unfortunately,  this  does  not  enable  us  to  conclude 
that  the  drift  of  the  Lyapunov  function  is  negative  in  UM  because 
A/,+,  may  well  be  in  a  different  region  than  A/,.  However,  this 
change  of  region  becomes  unlikely  if  we  exclude  a  small  zone 
around  the  lines  *  =  1  ±  3y  where  V  changes  definition  and 
indeed  the  second  part  of  this  proof  consists  of  showing  that  the 
Lyapunov  function  has  a  negative  drift  in  the  remainder  of  the 
state  space. 

Proposition  2:  There  exist  A/0  a  0  and  S0  >  0  such  that  for  all 
N  a  A/0and  for  all  (n,  s)  E  UN  fl  [C(-  «,  -47)  U  C(-27, 
27)  U  C(47,  00)], 

£(K(A/,+  i)-  K(A/,)|A/,=(n,  s)]<  -60. 

Proof  of  Proposition  2:  We  consider  separately  likely  and 
unlikely  events 

E[V{M„x)-V(M,)\MMn,  *)1 

=  £[(K(Af,+,)-  F(A/,))/(|.A,-2,|  <.J)\Mt  =  {n,  s)] 

+  £[(K(A/(+t)-  k'(M))/(|-4f_2,| >J)\M,  =  (n,  s)J.  (18) 

We  start  by  showing  that  the  first  term,  which  corresponds  to 
likely  events,  is  negative  when  /  is  large  by  using  the  properties  of 
the  truncated  drifts  from  Proposition  1  and  a  simple  geometric 
result.  The  lemma  below,  whose  proof  is  in  [9],  gives  a  measure 
of  how  much  a  cone  C  (Xo,  Xi)  expands  if  each  of  its  points  is 
allowed  to  move  of  some  distance  that  cannot  exceed  B  in  absolute 
value  along  each  axis. 

Lemma:  Consider  -v  >  0,  B  >  0,  and  7  -  1  <  Xo-  <  Xi  < 

+  00;  and  assume  that  \n  -  n'  |  <  B,  |s  -  s'  \  <,  B,  and  Q  a  B/ 
7  (1  +  |Xi|)(X|  +  2  +_  7).  Then: 

1)  0 n ,  s)  e  c(Xo,  a)  n  uQ 

=>  (n'.s')  €  C(Xo-7.  «)  n  UQ.B 

2)  (n,  s)  E  C(-oo,  X,)  fl  Uq 

=»  (n' ,  s')  E  C(—  00,  X1  +  7)  H  Uq.b 

3)  (n,s)  €  C(Xo,  X|)  n  UQ 

=»  («' ,  s')  E  C(Xo~7,  Xi  +  7)  H  Uq.b. 

Set  B(J)  -  max  {/,  [«!  +  )3},  and  define  Q(J)  to  be  any  real 
such  that  Q(/)  >  max  {£(/)  +  M,  B(J)/y  (1  +  47)  (2  + 
37)}.  Wehave  |S,+,  -  S,|  s  [or{  +  (3  £  £(/),  and  if  \A,  -  S,| 
<  /,  then  [Xi+i  -  X,\  <  J  <  B(J).  From  the  lemma,  Q(J)  is 
such  that 

M,  E  C(~2y,  2y)  fl  Uq^j)  =>  M,+ 1  E  C(-3y,  3y)  H  Um 

(19) 

M,  E  C(4 7,  co)  fl  £/q(/)  =>  A /,+[  E  C(3y,  eo)  n  UM  (20) 
A i,  E  C(-»,-4 7)  D  Uq(J)  =»  A/,+  i  E  C(-co,  -37)  fl  UM 

(21) 

where  M  has  been  defined  in  Proposition  1.  Assume,  for  instance 
that  M,  belongs  to  C(-27,  2y)  fl  (%,).  From  (19),  M,+  \  E 
C(  -  3y,  37)  n  n  C(  -  57,  57)  n  UM.  Hence,  if  /  s  J0,  we 
can  apply  Proposition  1  i): 

£[(K( A/,+  ,)-  K(A/,))/(|/l,-S,|</)tiV/,  =  (n,  s)] 

=  c(n,  s,  J)<,  -5+v(J). 


If  M,  belongs  to  the  other  two  regions,  C  (47,  00)  n  or 
C(-  oo,  -47)  H  Uq{J„  a  similar  argument  holds,  using 
Proposition  1  iii)  and  ii),  respectively,  along  with  (20)  and  (21).  It 
follows  that  for  all  /  3:  J0  and  for  all  («,  s)  £  C/gyj  D  [C( -  00, 
-47)  U  C(-27,  27)  U  C(47,  00)] 

fUW+i)-  P(A/,))/(M,-2,|s/)|A/,  =  (rt,  s))<  -6 ,  +  v,(/) 

(22) 

with  5i  =  min  {1,  1- 37/37)6  and  r((/)  =  i>(/)  1  +  37/  37. 

To  deal  with  the  second  term  on  the  right-hand  side  of  (18),  we 
consider  the  further  decomposition 

E[(V(Ml+i)-V(Ml))I(\Al-2l\>J)\Ml«{n,s)) 

=  £[( K(A/,h)-  K(A/,))/(|/1,>S,  +  /)|A/,  =  (/i,  s)) 

+  £[(P'(A/,+  i)-  K(A/,))/(2,>/4,  +  /)|  A/,  =  (n,  s)l.  (23) 

Let  us  denote  by  7i(/i,  s,  J)  and  Tfn,  s,  J)  the  two  terms  on  the 
right-hand  side  of  (23).  The  first  term  Tt(n,  s,  J)  corresponds  to 
a  case  where  the  variation  in  the  backlog  is  bounded  below,  and 
can  be  shown  to  vanish  as  /  increases  by  using  the  sole  fact  that 
the  mean  arrival  rate  X  is  finite.  Consider  now  Tz(n,  s,  J ).  If  M, 
=  («,  s)  belongs  to  a  region  such  that  x  -  n/s  >  .v0,  then  xQ  can 
be  chosen  large  enough  so  that  if  M, + 1  belongs  to  C  ( -  00 ,  -  37) , 
then  the  error  in  the  backlog  estimate  which  results  from  the  large 
number  of  successes  just  compensates  the  initial  error  n  -  s>  0. 
On  the  other  hand,  when  M,  belongs  to  any  region  such  that  x  is 
bounded  above,  then  £[£,/(£,  >  J  )\M,  =  (n,  5)]  goes  to  zero 
uniformly  in  ( n ,  s)  and  T2(n,  s,  J )  can  be  dealt  with  by  using  the 
following  rather  crude  bound  for  the  variation  of  V: 

|F(A/,+  l)-F(A/,)j£max  1  ■  ^ ] 

•  (j«|+j3+K-S,|)££(l+|A,-S,|)  (24) 

where  R  is  some  positive  constant.  It  is  shown  in  [9]  that 

£[(K(A/,+  1)-  K(A/,))/(M,-S,|>/)|A/,=(n,  s)] 

s^(/)  +  ey(/i,  s)  (25) 

where  lim^-a,  i>2(J)  =  0,  and  6 j(n,  s)  is  a  nonnegative  function 
that  depends  on  /,  and  goes  to  zero  as  either  n  or  s  goes  to  infinity. 

By  using  (22),  (25),  and  the  decomposition  (18),  we  get  the 
desired  result  that  the  drift  of  V  is  negative  in  this  part  of  the  state 
space:  fix  an  integer  /m,n  such  that  /m,„  >  J0  and  that  for  all  J  > 
Jmn,  vfJ)  +  vfJ)  <  5[/3.  Then  from  (22)  and  (25),  we  have 
for  all  (n,  s)  E  UQUml>)  D  [C(-°o,  -47)  U  C(-2y,  2y)  U 

C(47,  00)], 

£T[K(A/t+,)- K(A/,)|A/,  =  (n,  s)]s  ~  5l  +  e,m>  .*)■ 

Then  we  can  choose  an  Mo  >  Q(/mm)  which  is  large  enough  so 
that  !„(«.  s)  <  5|/3  for  all  ( n ,  s)  in  U»0.  □ 

This  concludes  the  second  part  of  the  proof.  Unfortunately,  it  is 
not  always  true  that  the  drift  of  V  is  negative  outside  a  finite  subset 
of  the  state  space.  For  instance,  we  have  proved  that  in  the  case  of 
the  usual  collision  channel  with  Poisson  new  packet  arrivals,  there 
exist  constants  Btx  >  0  and  Mex  such  that  for  all  (n,  s)  €  UMfX  for 
which  x  =  1  ±  3y,  and  for  all  a  and  (3  verifying  C!  and  C2, 
£[K(A/,+1)  -  V(M,)\M,  =  (n,  s)]  >  Ba.  However,  discontinui¬ 
ties  around  the  lines  x  =  1  ±  37  cancel  out  when  one  waits  long 
enough,  and  in  the  last  part  of  this  proof  we  show  that  the  /-step 
drift  of  V,  E[V(M,+J)  -  V{M,)\M,  =  (n,  s))  is  negative  for 
some  integer  /. 

Proposition  3:  There  exist  //  >  0,  p  >  0,  and  M/>  0  such 
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that  for  all  (n,  s)  6  UM} 

E[V{M,+Jf)-  s)}<  -p. 

Proof  of  Proposition:  One  of  the  main  problems  in  dealing 
with  the  7-step  drift  of  V  is  to  control  the  changes  of  regions 
between  M,  and  M,+j.  To  this  end,  we  define  the  stopping  time 

ry  =  min  s> 0,  £  (/l,+*-2,+*)|>73  j  . 
v.  |jt-o  I  ) 

If  tj  >  7,  then  for  1  £  k  <,  7,  |A,+*  -  X,\  ^  73  and  \Sl¥k  -  S,\ 

<  7(|a|  +  j3).  Thus,  if  we  define  £'(7)  =  max  (7(|a|  +  j3), 
73},  and  Q'{J)  to  be  any  integer  such  that  Q'(J)  s  B'(J)  + 
max  {M0,  M}  and  Q’(J)  >  2B'(7)/7(1  +  9/27)(57  +  2), 
then,  still  assuming  that  tj  >  7,  we  get  from  the  lemma  for  0  <  k 

<  7 

A/,  €  C  (~a>,  O  Uq,(J) 


s 


=»  Muk  e  C(-C0,  -47)  n  UMq  (26) 

A/,  €  C  (-27  +  2, 27-*)  0  UQ,(J) 

-  A/l+*  €  C(—2y,  27)  fl  t/Mo  (27) 
A/,  €  C  +  | .  «>^  H  t/Q,(y) 

*  M,+*  €  C(47,  «)  0  (/Mo  (28) 

A7,  €  C  ^-47-|  ,  -27 n  Uqhj) 

-  A71+*  €  C(-5y,  -y)  D  UM  (29) 

m,  e  c  (27-5,47+3)  n  Uq,U) 

•  M+*  e  C(7, 57)  n  i/M.  (30) 
In  other  words,  we  have  partitioned  the  plane  into  two  zones 


Fig.  2.  If  M,  6ZNn  UQlj)  and  if  t j  a  J,  then  M,,i  belongs  to  the  region 
where  the  drift  of  V  is  negative. 


s 


!ig.  3.  If  Af,  6  Zp  O  Uq'(J)  and  if  tj  a  J,  then  M,,,  belongs  to  a  region 
where  two  properties  of  Proposition  1  hold. 


ZS=C 


u  c 


and 


as  follows: 

s)] 
j- 1 

=  £  £[£[K(A/,+*+I)-K(AC*)|A/,+*] 
*- 0 


ZP=C 


u  c 


J- 1 

•  I{rj*J)\M,  =  (n,  5))+%  E{E[V(M„kkl) 


k- 0 


Then  we  have  chosen  Q'{J )  such  that  if  M,  belongs  to  ZN  which 
is  slightly  smaller  than  the  region  in  which  the  drift  of  the 
Lyapunov  function  is  negative,  and  if  Tj  >  7,  then  the  Markov 
chain  remains  in  the  region  in  which  Proposition  2  applies  up  to 
time  t  +  7  (see  (26)-(28)  and  Fig.  2).  Q'(J )  is  also  such  that  if 
M,  is  in  ZP  and  if  tj  >  7,  then  up  to  time  t  +  7  the  chain  stays  in  a 
region  such  that  two  out  of  the  three  properties  of  Proposition  1 
hold  at  each  step  (see  (29),  (30),  and  Fig.  3). 

We  start  by  showing  that  the  7-step  drift  of  V  is  negative  at  («, 
s)  when  (n,  s)  belongs  <0  ZN.  We  decompose  the  7-step  drift  of  V 


-  K(M,+*)|A/,+*]/(t,<7)|M,=  (/i,  s)].  (31) 

Denote  by  (7,(7,  n,  s )  and  (72(7,  n,  s)  the  two  sums  on  the  right- 
hand  side  of  (31).  If  Tj  >  7,  then  (26)-(28)  hold,  and  therefore  we 
can  apply  Proposition  2 

(7,(7,  n,  s)<,  -J6oP[TjZj\M,=  (n,  5)).  (32) 

Let  us  now  show  that  Tj  <  7  is  indeed  an  unlikely  event,  the 
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probability  of  which  goes  to  zero  as  1/7  uniformly  in  ( n ,  s) 
P[rj<J\Mi  =  (n,  s)] 


since  max  { a ,  b}  -  max  {c,  d }  <  max  {a  -  c,  b  -  d }.  Then 
using  the  fact  that  max  {a,  b }  £  max  {0,  a  +  /}  +  max  {0,  b  + 
/}  - /for/ 2:  0,  we  get 


s  5)  P  j|s  04,+;-S,+/)|>73|M,=(n,s) 

k-0  L  |  /-0  I  -* 

sS  p  ["  X 

Ar»0  Ll«0  -i 

+2pi  ss,+i>7jiw(=(,».j)  . 

k-o  L/-o  J 


+  S  p\  2  W>73]a/, =(«,*)  . 

k-0  L/.O  J 

From  Markov’s  inequality  we  have 

.  7-1 

P[Tj<J\Mt=(n,s)]£Ti  2  (Ar+  I)X 

J  km  0 

+~  S  ££[SI+/|M=(«,*)J- 

^  k-0  1-0 

Denoting  by  Bn  an  upper  bound  on  the  sequence  /„(/>*),  it  follows 
from  Section  II  that  £[2(+/|A7,  =  (n,  s)]  =  £[£IS,+/|A/',.m]|M/ 
=  ( n ,  ^)]  <  £„  so  we  get 

P[T,<7|M,»(n,s)]£^^£^  (33) 

where  B,  is  some  positive  constant.  From  (24),  it  is  easy  to  check 
that  the  drift  of  Fis  bounded  by  some  positive  constant  By,  so  that 

n,  s)zJBvP[Tj<J\Ml  =  {n,  j)J.  (34) 

Considering  (31),  (32),  (33),  and  (34),  we  get 

j  E[V(M,+J)- F(Af,)|Af,  =  (n.  s)]<, -80J+(By+80)Br. 

Therefore,  there  exist  constants  /q  >  0  and  J,  >  0  sue!,  that  for 
all  J  >  J\  and  for  all  ( n ,  s)  G  0Q'{J)  D  ZN, 

E[V(Ml+j)~  V(M,)\M,=(n,  s)]£  -7/»,.  (35) 

We  now  proceed  to  show  that  the  7-step  drift  of  the  Lyapunov 
,  function  is  negative  in  the  remaining  part  of  the  state  space  ZP 

1  consisting  of  the  two  cones  around  x  =  1  ±  3-y.  This  is  done  in 

3  two  steps.  We  first  show  that  the  7-step  drift  of  V  restricted  to 
likely  events  {77  >  7}  goes  to  -  °°  as  7  increases,  and  then  we 
prove  that  the  7-step  drift  of  V  restricted  to  unlikely  events  {77  < 
7}  is  bounded  above  independent  of  7. 

Assume,  for  instance,  that  ( n ,  s)  G  C(y  -  y/2,  4y  +  y/2)  D 
Eq'U).  The  difficulty  here  is  that  V  can  take  two  possible  values, 
-  and  therefore  Proposition  1  cannot  be  used  directly.  If  77  >  7, 

I  then  from  (30)  M,+k  G  C  (7,  57)  f)  Um  for  0  £  k  <  7,  so  that 

*  V (A7/ 1. k)  =  max  {X,4.k<  (1  +  3y)/3y(Xt+k  -  S/+*)}.  There¬ 

fore, 

E{(V(Mn.j)~  V (M,))I (772  7)| M,  =  (rt,  s)) 

•E  [max  ^(+y,-^(^+y-S(W)j 

1  •  I(Tj<zJ)\M'=(n,  s)j 

-E  j^max  j*„  (7f,-S,)j 

|  ■  I{Tj7tJ)\M,=(n,s)^ 

<E  [max  ^XlkJ-Xt,~^-(-kl+j+Xt) j 

■  I(Tj>J)\M,  =  (n,  s)j 


X„j-X„~H-X,.j+Xt)\ 

■5T 


I(Tj>J)\Mr 


E[(V(MI+J)-  V(M,))I(jj^J)\Mt={n,  s)] 
££  Jjnax  [0,  Xl+J-Xr +5|  J-  j 

•  /(r/2:7)|A/l  =  (n,  s)j 
+  E  £max  [0,  ~~2~  (~X,kj+X,)  +  8i 
■  I(TjZJ)\M,  =  (n,  s)j 
-E  ^/(ry£7)|M,  =  («,5)l 


I(rj^J)\Mr 


where  5t  =  min  {1,  (1  —  37)/37}  has  been  defined  in  (22).  We 
show  that  the  first  two  terms  on  the  right-hand  side  of  (36)  are 
bounded.  Since  (33)  lim^oo  -  8\J/2P[tj  >  7]  =  -00,  this  will 
be  sufficient  to  prove  that  lim^o,  E[(V(M,+j)  -  > 

7)|A7,  =  (n,s)]  =  -cd.  Define  Wk  -  X,+k  -  X,  +  £71/2  and 
Pk  =  P/+*;  where  F,  is  the  sigma-field  generated  by  {A„  s  <  t  - 
1;  X„  s  £  /} ,  representing  the  history  of  the  process  (Af,),ao  up  to 
time  /.  To  prove  that  the  first  term  in  (36)  is  bounded,  we  show 
that  there  exists  <f>  >  0  such  that  (/*,  Pk)  is  a  supermartingale, 
with  Yk  =  e<,wkI(TJ  >  k).  We  need  .0  show  that  E[  Yk+  f  JF*]  < 
Yk,  which  is  equivalent  to 

E[e^xt+k+i-krk{k+in)iOi(Tj^k+[)\F^k) 

<  e^xl  +  k~  Xt  +  (*/2)S  |  )J  (r^  g.  k) 

since  I(tj  a:  k  +  1)  =  I(tj  >  k)!(Tj  2:  k  +  1),  and  1(tj  >  k)  is 
measurable  with  respect  to  Fnk 

/(ry>:A:)£[e^+t:+|-^/+*+*i''2)|F,+J</(ry>A:).  (37) 

Now  if  tj  >  k,  then  from  (30),  A/,+*  G  C( 7,  57)  fl  UM,  Lemma 
2.2  in  [11]  states  that  if  X  is  a  random  variable  such  that  |A’|  is 
stochastically  dominated  by  an  exponential  type  random  variable 
Z,  and  if  the  expectation  of  X  is  strictly  negative,  £1^1  <  -e, 
then  there  exist  two  constants  7?  >  0  and  p  <  1  such  that  E[e’’x] 
<  p  <  1 .  Hence,  there  exists  4>  >  0  such  that 

for  all  (n,  s)  G  C(~5y,  5y)  O  UM, 

E[e^xi+i~xi+i/z>\M,s‘(n,  s))<l  (38a) 

for  all  ( n ,  s)  G  C(-oo,  -y)  f)  UM, 

E(e**t+ 1  -'J‘W/2)| (n,  s)]  <  1  (38b) 

for  all  ( n ,  s)  G  C(y,  00)  n  UM, 

E ( e*l‘ ~ + 1 +  *t + i/2) | M, = (n ,  s)]  <  1 .  (38c) 

It  follows  from  (37)  and  (38a)  that  (Yk,  Pk)  is  a  supermartingale. 
Therefore, 

E[  Yj\P0]  =E[*wjI(tj>J)\ F,)<E{Y0\Po]=\.  (39) 

Finally,  considering  that  max  {0,  x}  £  l/<t>  e*x,  it  follows  from 
(39)  that  the  first  term  in  (36)  is  bounded.  Using  (30)  and  (38c),  it 
can  be  shown  with  the  same  method  that  the  second  term  in  (36)  is 
also  bounded.  Thus,  threre  exists  a  constant  BT  independent  of  7 
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such  that 

E[(V(M,+j-  V(M,))I(tj^ /)| M, = («,  s))£BT-J- 

The  case  (/t,  s)  €  C(-4y  -  y/2 ,  -2y  +  y/2)  fl  C/g-yjCanbe 
dealt  with  in  a  similar  way,  using  (38a)  and  (38b).  Therefore,  we 
have  shown  that  there  exist  nz  >  0  and  Jz  >  0  such  that  for  all  J 
s  Jz  and  for  all  (n,  s)  S  Yq-{J)  fi  Zp 

E[(V(MI+J)- V(M,))I(Tj*J)\M,  =  (n,  s)]< -Jn 2.  (40) 


Theorem  5:  Suppose  that: 

i)  the  number  of  new  packet  arrivals  per  slot  has  finite  second 
moment  E[A}\  <  +oo; 

ii)  there  exists  A  6  (0,  +oo)  such  that  t(A)  -  sup^i0 

iii)  CO":  there  exists  B  <  +»  such  that  for  all  n  >  1, 
Zj5.t  khnk  £  B. 

Fix  X  <  iic and  £  >  0  such  that  X  <  f  (/4£).  Choose  a  <  0  and 
0  >  0  such  that 

Cl':  0(eA(-  1)  =  ^~^^-  eAi-a 


It  is  shown  in  [9]  that  there  exist  a  constant  B  >  0,  a  function 
v}(J)  with  limy_„  v3 (J),  and  a  nonnegative  function  vj{M) 
depending  on  J  verifying  limw-M  vj(M)  =  0,  such  that  for  all  (n, 
s)  €  Uq‘(J)+m\  fi  ZP, 

E[(V(Mi+j)~  s)\ 

<B+v}(J)  +  vA  M{).  (41) 

We  are  now  ready  to  conclude  the  proof  of  Proposition  3.  From 
(40)  and  (41),  we  have  for  all  ( n ,  s)  6  Uq'(J)+m i  ^  ZP, 
E[V(MnJ)  -  V(M,)\M,  =  (n,  s))  <  B  -  +  v3(J)  + 

vj(M\).  Fix  an  integer  J/  >  max  {Ju  Jz}  such  that  for  all  J  £  Jf, 
B  -  Jfiz  +  W}(. J)  <  ~nz.  Then  for  all  <n,  s)  €  Vq-v, >+*/,  fi 
ZP,  we  have  E[V(MUJ/)  -  V{M,)\Mt  =  (n,  j)]  £  ~hz  + 
vj.{Mx).  On  the  other  hand,  we  also  have  from  (44),  for  all  (n,  s ) 

€  n  ZP 

E[V(M„ jy)-  V(M,)\M,  =  (n,  s)]£  -  Mi J/- 

Now  fix  Mi  large  enough  so  that  vj(M{)  £  n2/2.  Then  define  M, 
=  Q'Uf)  +  Mi,  and  p  =  min  {n2/l,  JjV i}*  □ 

We  can  now  conclude  that  (M,),* 0  is  geometrically  ergodic  for 
X  <  i/c  by  invoking  the  following  result. 

Theorem  (Hajek  [11]):  Let  {IV,}  be  a  sequence  of  random 
variables  adapted  to  an  increasing  family  of  a-fields  {£,}. 
Suppose  that  ffo  is  deterministic,  that  { W,,  F,}  is  exponential 
type,  and  that  for  some  e  >  0  and  a  >  0  we  have  El(fVl+l  -  fV, 

-  e )  I(fV,  >  a )  |F,]  £  0  for  all  /  2:  0.  Then  for  each  value  of 
W0  the  stopping  time  r  =  min  {f  >  0;  fV,  £  a} is  exponential 
type. 

Define  tV,  =  V(M,Jf)  and  a  =  M/  max  {1,(1  +  3-y)/3y,  (1  - 
3y)/3y}.  If  V(M,)  >  a ,  then  M,  6  U/,y.  From  (24)  and  CO 
(V{M,),  F,)  is  exponential  type  since  A,  is.  From  Proposition  3, 
we  can  apply  Hajck’s  result  to  our  system  to  conclude  that  t  = 
min  {/  >  0,  <  a}  is  exponential  type  for  any  initial 

state.  Since  V{Mt)  £  a  implies  that  X,  £  a  and  S,  £  c/(l  - 
3y),  it  follows  that  r'  =  min  {/  >  0,  XtJf  £  a,  and  S,jj  £  a/(  1 

-  3y)}  is  also  exponential  type  for  any  initial  state,  as  well  as  r" 

=  min  {/  >  0,  X,  £  a,  and  S,  <  a/(  1  -  3y)}.  Hnce,  it  follows 
from  [14]  that  (X„  S,)  is  geometrically  ergodic,  concluding  the 
proof  of  Theorem  4.  □ 


X-KAO-re-*-11 

c;:  - 

Then  the  control  algorithm 

A 


S,+i  =  max  {A,  S,  +  o</(Z,  =  0)  +  /3/(Z(  =  0)} 

is  stable. 

Proof  of  Theorem  5:  Let  us  state  first  Mikhailov’s  Theorem 
(cf.  [35]  for  an  exposition  of  this  result  and  its  application  in  the 
decentralized  control  of  the  conventional  collision  channel). 

Theorem  (Mikhailov  [19]):  Let  M,  =  (X„  5,)  be  a  homogene¬ 
ous  Markov  process  on  R*  x  R *  with  drifts 

(c(n,  s),  e(n,  s))  =  E[Mt+l-M,\M,  =  (n>  s)]. 

Suppose  that: 

i)  there  exists  B  <  +co  such  that  for  all  (/?,  s)  G  R*  x  RT, 
£[||M,+  |  -  M,y\Mt  -  (/j,  s)]  £  B\ 

ii)  for  all  ]/  £  (0,  + »),  the  drifts  ( c{n ,  n/\ ]),  e(n,  n/]/)) 
converge  uniformly  in  i/-  as  n  goes  to  infinity  to  (c( \p), 

iii)  the  limit  drifts  (c(^),  e(\p))  are  differentiable  on  [0,  +  oo), 
with  (e(0),  e(0))  =  lim^oo  (o(0,  s),  e(0,  s))\ 

iv)  there  exists  e  >  0  such  that  if  c{]/0)  =  ]/0  efi/'o),  then  c(]/0) 
<  -e. 

Then  M,  is  stable. 

Since  boft.  die  new  packet  arrivals  and  the  rows  of  the  reception 
matrix  have  finite  variance,  it  is  easy  to  check  that  condition  l)  in 
Mikhailov’s  Theorem  holds 

£[||M,+  I-M,||2|M,=(n,  s)] 

=E[(Xl+l-X,)2+(Sn.i-S,)2\M,  =  (n,  s)]. 

Now  £[(S/+.|  -  S/)2|M,  =  ( n ,  s)]  <  a1  +  fi2,  and  from  (2) 

£[(Jf,+  1-A))2|M(  =  (H,j)]s£[/l2]  +  F[r?|M(  =  (/i,s)]. 
From  CO'  the  variance  of  the  number  of  successes  is  also  bounded 


IV.  Stability  Proof  Via  Mikhailov’s  Theorem 

Mikhailov  [19,  Theorem  3]  has  recently  found  a  powerful 
sufficient  condition  to  guarantee  the  stability  of  a  Markov  process 
taking  values  on  R+  x  R* .  This  result  can  be  used  to  weaken  the 
sufficient  conditions  we  imposed  in  Section  III  and  obtain  a  much 
more  simple  proof  of  stability.  However,  the  form  of  stability 
used  by  Mikhailov  is  weaker  than  the  geometric  ergodicity  used  in 
Section  ID. 

Let  M,  be  a  discrete-time  Markov  process  taking  values  in  Y  £ 
Rn,  U{r)  =  {x  e  £n:||*||  <;  r },  and  rx(S)  =  min  {f  2:  0 :M,  € 
S|M0  =  x},  i.e.,  tx(S)  is  the  time  it  takes  to  reach  the  set  S  from 
x.  Then  we  say  that  the  process  M,  is  stable  if  there  exist  constants 
c!  and  c2  such  that  E[rx(U(r))]  £  c,  ||*||  +  c2  for  all  x  €  Y. 
Using  this  definition  of  stability  we  show  the  following  result 
which  is  analogous  to  Theorem  4. 


£[S2|M,=(n,s)] 


n-j 

ijk£B. 


It  follows  directly  from  (16)  and  (17)  that  the  limit  drifts  are 
given  by 


c('p)  =  \-t(A'p) 
e(^)=0-r(,a-0)e~A't', 

respectively,  for  €  [0,  +  »).  Uniform  convergence  to  the  limit 
drifts  follows  immediately  from  the  results  given  for  the  perfect 
state  information  case  (Property  4).  Also  it  is  clear  the  t{x)  is 
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differentiable  (see  (6),  where  0  <  C„  s  ri).  Therefore,  properties 
ii)  and  iii)  in  Mikhailov’s  Theorem  are  satisfied. 

In  order  to  check  property  iv)  note  that  if  =  £,  then  it 
follows  from  Cl'  that 

c(M  =  <!'oe(4'o)- 


1>  One-Dimensional  Kaplan’s  Condition:  Consider  the 
model  of  Section  II  with  a  control  scheme  p„  =  F(Xn),  and  the 
Lyapunov  function  F(x)  =  x.  To  check  Kaplan’s  condition,  it  is 
enough  from  [27]  to  show  that  the  downward  part  of  the  drift 
-D{i)  =  kPij.it  is  bounded  below.  For  /  >  1  and  1  <  k 
£  i  we  have 


But,  at  that  point,  c(ip0)  <  0  because  of  the  choice  of  |.  There  is 
no  other  root  of  the  equation  c(\p)  =  \pe(\p),  and,  therefore, 
property  v)  follows.  To  see  this,  note  tit  ?!  because  of  Cl ' ,  c(4>)  = 
^e{ip)  for  is  equivalent  to 


which  is  impossible  #  £  because  of  C2 ' .  □ 

It  can  be  shown  [9]  that  m$(X)  is  finite  for  all  nonnegative  X  and 
£,  and  therefore  the  set  of  control  laws  defined  by  Cl'  and  C2'  is 
nonempty.  Actually,  trie  set  of  control  laws  in  Theorem  4  is  a 
subset  of  those  in  Theorem  5  because  in  Theorem  5  we  can  choose 
£  =  I,  in  which  case  C2  is  equivalent  to  Cl '  and  Cl  is  more 
restrictive  than  C2'  because  X  >  m^X)  [9],  □ 

V.  Conclusion 


fl- 0  j-k+n  W 

After  a  change  of  variable,  it  follows  that 
DU)=  £  (  ‘  )  F(/)'(l -F(i))‘-j  £  Xn  £  ( *-«>«;.*■ 

(A-l) 

If  (C„)nal  is  bounded,  then  Kaplan’s  condition  holds  independent 
of  the  retransmission  policy.  Denoting  by  Bc  an  upper  bound  for 
(Cn)„£i,  (A-l)  becomes 

-z>(/)fc~s  ( ' )  Fmi-F(i)y-'Jj}  kcj 

J.l  W  „.0 


In  this  paper  we  have  investigated  the  properties  of  decentral¬ 
ized  control  algorithms  for  a  random  access  channel  with 
multipacket  reception  capability.  By  using  the  working  hypothesis 
that  the  users  are  aware  of  the  value  of  the  backlog,  we  have 
determined  the  best  throughput  achievable  by  any  such  protocol, 
as  well  as  a  simple  way  to  achieve  it.  The  optimum  throughput  has 
been  shown  to  be  given  by  the  maximum  average  number  of 
successes  per  slot  when  the  number  of  transmissions,  per  slot  is 
Poisson  distributed.  In  the  imperfect  state  information  case,  we 
have  shown  that  the  same  throughput  achieved  in  the  perfect  state 
information  case  can  be  achieved  by  using  in  lieu  of  the  true 
backlog,  an  estimate  of  the  backlog  computed  at  each  station  using 
binary  feedback,  and  we  have  used  this  estimate  to  derive  a 
control  scheme  which  is  optimal  in  the  sense  that  it  achieves  the 
optimal  throughput  determined  earlier.  This  is  true  provided  the 
reception  matrix  verifies  condition  CO,  which  puts  some  restric¬ 
tions  on  the  number  of  successes  per  slot.  By  using  Mikhailov’s 
result,  CO  can  be  replaced  by  the  weaker  condition  CO'.  In  this 
case  however,  geometric  ergodicity  was  not  ensured.  Note  that 
the  feedback  empty/nonempty  used  in  Sections  EQ  and  IV  may  be 
less  than  the  available  feedback  in  many  practical  situations,  but 
no  further  information  is  needed:  a  ternary  feedback  would  not 
shorten  the  proof  or  achieve  better  throughput. 

Finally,  let  us  mention  that  one  can  easily  modify  the  proof  of 
Theorem  4  to  show  that  a  similar  result  nolds  with  the  IFT  access 
rule.  More  precisely,  under  a  hypothesis  paralleling  those  of 
Theorem  4,  one  can  build  a  control  scheme  based  on  a  binary 
feedback  empty/nonempty  such  that  the  Marko  •  sector  ( X„  S,)  is 
geometrically  ergcdic  for  X  <  supx£0  T{x).  Using  Theorem  3,  it 
can  be  seen  that  the  maximum  stable  throughput  is  he  same  for 
both  access  rules  when  the  new  packet  arrivals  a,c  Toisson 
distributed. 


*  -  S  Q  W 0 -FUtf-JQts  -Bc.  (A-2) 

2)  Two-Dimensional  Kaplan’s  Condition:  Consider  now  the 
multipacket  channel  with  a  general  control  algorithm  (1).  Then 
( X„  S,)  is  the  Markov  chain  of  interest,  and  the  relevant  Lyapunov 
function  is  V(n,  s)  =  n.  We  prove  again  that  Kaplan’s  condition 
holds  provided  that  (C„)n£1  is  bounded.  From  [27],  it  is  enough 
also  in  this  case  to  show  that  the  downward  part  T(xr)  of  the 
generalized  drift  is  bounded  below,  with  T{x)  =  2 y/yw<y(x>Pxy 
(V(y)  -  V(x)).  Given  a  state  x  =  (/,  s),  we  have 


r(Jf)»  -S  P[X„^  =  i-r,  Sn+i  =  k\X„  =  i,  Sn=s ] 

rml  k 

i 

=  rPlXn+i  =  i-r\x„  =  i,  S„  =  s] 

rm\ 

which  is,  in  the  same  way  as  before 

T(x)=  -2  r  g  X„  2  (  J)  (FW(1  -F(j))-V« 

r.l  0  J.r+n  V / 

I  /A  J- 1  J 

-  -S  wo -F(s))-1 2  x„  2 

/■I  V  /  /■-«+ 1 

this  expression  is  similar  to  (A-l),  and  the  end  of  the  proof  is  the 
same  as  in  (A-2). 


Appendix 

Kaplan’s  Condition 

Consider  a  Markov  chain  with  denumerable  state-space  D,  and 
one-step  transition  probability  matrix  (/VWieo-  Let  V(x)  by  a 
Lyapunov  function  on  D.  Then  the  generalized  Kaplan's  condition 
holds  if  there  exists  a  positive  constant  B  such  that  for  all  z  £  [0, 
1[  and  all  x  6  D 

zVM-  2  PxyZ^^-BO-z). 

y<zD 
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Abstract — Optimum  decentralized  demodulation  for  asynchronous 
Gaussian  multiple-access  channels  is  considered.  It  is  assumed  that  the 
receiver  is  the  destination  of  the  information  transmitted  by  only  one 
active  user,  and  single-user  detectors  that  take  into  account  the  existence 
of  the  other  active  users  in  the  channel  are  obtained.  This  approach  is  in 
contrast  to  both  conventional  demodulation,  which  is  fully  decentralized 
but  neglects  the  presence  of  multiple-access  interference,  and  globally 
optimum  demodulation,  which  requires  centralized  sequence  detection. 
The  problem  considered  is  one  of  signal  detection  in  additive  colored  non- 
Gaussian  noise,  and  attention  is  focused  on  one-shot  structures  where 
detection  of  each  symbol  is  based  only  on  (he  received  process  during  its 
corresponding  interval.  Particular  emphasis  is  placed  on  asymptotically 
optimum  detectors  for  each  of  the  following  situations:  1)  weak 
interferers,  2)  CDMA  signature  waveforms  with  long  spreading  codes, 
and  3)  low  background  Gaussian  noise  level. 

I.  Introduction 

HE  conventional  approach  to  the  demodulation  of 
code-division  multiplexed  multiuser  digital  com¬ 
munications  is  to  demodulate  each  user  as  if  it  were  the  only 
user  in  the  channel.  The  multiple-access  capability  of  such 
systems  is  thus  achieved  by  using  complex  signal  constella¬ 
tions  that  exhibit  favorable  cross-correlation  properties.  (See, 
for  example,  [1]  for  a  description  of  conventional  multiple- 
access  demodulation  techniques.)  However,  recent  work  by 
Verdu  [2]  has  shown  that  substantial  performance  gains  can  be 
achieved  in  coherent  multiuser  systems  by  using  a  receiver  that 
takes  advantage  of  the  structure  of  the  multiple-access  interfer¬ 
ence.  For  examp'e,  this  approach  can  be  used  to  alleviate  such 
limitations  as  the  near/far  problem  in  the  direct-sequence 
spread-spectrum  multiple-access  (DS/SSMA)  format.  The 
performance  gains  realized  by  the  receiver  proposed  in  [2]  are 
achieved  by  the  use  of  simultaneous  sequence  detection  of  all 
users  in  the  channel,  a  task  that  requires  a  centralized 
implementation  and  a  high  degree  of  software  complexity  (for 
example,  the  decision  algorithm  required  is  a  dynamic 
program  (DP)  whose  complexity  is  0( 2K)  where  K  is  the 
number  of  users  in  the  channel).  Since  the  implementation 
costs  of  such  fully  centralized  detection  algorithms  may  be 
unacceptably  high  for  many  applications,  and  since  network 
security  restrictions  may  not  permit  the  distribution  of  all 
user’s  signaling  waveforms  to  all  demodulating  terminals,  it  is 
of  interest  to  consider  demodulators  that  lie  between  these  two 
philosophies  of  conventional  demodulation,  in  which  other 
users’  signaling  waveforms  to  all  demodulating  terminals,  it  is 
optimum  demodulation,  in  which  all  users  in  the  channel  are 
tracked  and  demodulated  simultaneously.  The  p.  formance 
results  obtained  in  [2]-[4]  indicate  that  an  attractive  compro¬ 
mise  in  practice  is  to  use  optimum  multiuser  demodulators  for 
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only  a  subset  of  the  active  users  and  simply  neglect  the 
presence  of  all  other  users.  In  order  to  take  advantage  of  the 
superior  performance  achievable  by  multiuser  detectors,  the 
subset  of  active  users  to  be  taken  into  account  at  the  receiver 
should  contain  all  the  users  whose  power  is  sufficient 
regardless  of  whether  their  messages  are  destined  to  that 
particular  location. 

In  this  pape  we  consider  the  case  of  full  decentralization 
where  the  receiv  ,r  is  constrained  to  track  and  lock  to  the  signal 
of  only  one  user,  but  unlike  the  conventional  single-user 
detector,  is  optimized  to  take  into  account  the  structure  of 
multiple-access  interference  in  making  decisions.  We  consider 
several  design  approaches  that  can  be  used  to  optimize  these 
structures  depending  on  the  amount  of  information  that  one  ,s 
able  to  assume  to  be  known  about  the  signature  waveforms 
assigned  to  the  interfering  users. 

This  paper  is  organized  as  follows.  In  Section  II,  we  wiil 
first  discuss  the  general  structure  of  optimum  decentralized 
demodulators  that  simultaneously  track  and  demodulate  a 
group  of  D  users  from  a  total  population  of  K  users  sharing  a 
common  communications  channel  where  D  s  K.  We  then 
consider  the  structure  of  optimum  single-user  detectors  (D  = 
1),  and  particularize  to  the  case  K  =  2  to  illustrate  this 
structure.  The  results  of  Section  II  are  for  general  antipodal 
signaling  formats  In  Section  III,  we  turn  to  the  development 
of  specific  results  for  single-user  demodulation  of  DS/SSMA 
transmissions.  In  this  modulation  technique,  which  among 
coherent  signaling  formats  is  of  particular  practical  interest, 
each  signature  waveform  consists  of  a  sequence  of  chip 
waveforms  whose  polarities  are  determined  by  a  binary  word 
assigned  to  each  user.  The  specific  structure  of  the  direct- 
sequence  format  allows  for  the  development  of  useful  approxi¬ 
mations  to  optimum  single-user  detection  which  are  asymptoti¬ 
cally  exact  as  either  the  length  of  the  spreading  codes  or  the 
signal-to-background-noise  ratio  (SBNR)  increases  without 
bound.  We  also  show  that  (with  K  =  2)  even  in  the  absence 
of  any  prior  knowledge  about  either  the  spreading  codes  or  the 
timing  of  the  multiuser  interference,  the  optimum  single-user 
detector  is  not  multiple-access  noise-limited  as  the  back¬ 
ground  thermal  noise  level  vanishes.  This  is  in  contrast  to  the 
conventional  detector,  which  can  incur  an  irreducible  error 
probability  even  in  the  absence  of  background  noise.  All  of 
these  results  for  DS/SSMA  require  only  that  the  chip 
waveform  (which  is  usually  common  to  all  users  in  a  given 
network)  be  known.  Thus,  these  techniques  can  be  applied  m 
secure  networks  where  the  distribution  of  one  user's  spreading 
code  to  other  users  is  not  desirable. 

In  Section  IV,  we  return  to  general  coherent  signaling 
formats  tc  consider  the  problem  of  optimum  single-user 
detection  in  the  presence  of  weak  unlocked  interfering  users. 
We  mode!  this  problem  by  assuming  that  the  multiple-access 
interference  is  multiplied  by  a  small  amplitude  factor  e.  We 
then  derive  an  expression  for  the  likelihood  ratio  statistic  for 
optimum  symbol  decisions  on  the  locked  user  that  is  of  the 
form  of  the  conventional  correlation  statistic,  modified  by  an 
e2  term  involving  signal  cross-correlation  functions,  and  then 
having  higher  order  terms  of  order  e4.  The  resulting  locally 
optimum  detector  correlates  the  observation  with  a  replica  of 
the  waveform  of  the  user  of  interest,  suitably  smoothed  to  take 
into  account  the  presence  of  multiple-access  interference. 


0090-6778/88/01 00-0050S0 1 .00  ©  1988  IEEE 


POOR  AND  VERDU:  SINGLE-USER  DETECTORS  FOR  MULTIUSER  CHANNELS 


5  l 


II.  Optimum  Decentralized  Detection  for  Multiuser 
Channels 

Throughout  this  paper,  we  consider  a  received  signal  model 
of  the  form 

r,  =  S,(b)  +  nn  -co<t<co  (2.1) 

where  {/t,;  -  oo  <  /  <  00}  represents  white  Gaussian  noise 
with  spectral  height  ,V0/2,  and  where  the  received  signal  S,(b) 
is  the  superposition  of  transmissions  received  from  K  separate 
asynchronous  users,  i.e., 

S,(b)=j)  £  bk(i)sk(t-iT-rk)  (2.2) 

*•1  i«  -At 

where  T  is  the  symbol  interval,  bk(i)  is  the  ith  symbol  of  the 
Arth  user,  rk  is  the  relative  delay  (modulo  T)  with  which  the 
Arth  user's  transmission  is  received,  and  sk(t)  is  the  signature 
waveform  assigned  to  the  Arth  user.  (It  is  assumed  thats*(f)  is 
zero  for  t  €  [0,  T].)  Note  that  (2iW  +  1)  is  the  number  of 
symbols  per  user  in  the  given  transmission,  and  b  denotes  the 
K  x  (2 M  +  1)  matrix  whose  (Ar,  /)  entry  is  bk(i). 

Suppose  that  we  wish  to  demodulate  some  group  of  D  users 
from  the  total  population  of  K  users  where  D  £  K.  For 
simplicity  of  notation,  we  assume  that  these  D  users  of  interest 
are  labeled  I -D.  Thus,  we  know  $|,  •  •  • ,  so  and  n,  •  •  • ,  r0, 
and  the  maximum  likelihood  demodulator  chooses  a  symbol 
matrix  bD  3  {£*(/);  k  -  1,  ••*,£)}"_„  to  maximize  the 
log-likelihood  function 


maximization  of  (2.3)  with  a  dynamic  programming  algorithm 
yielding  a  TCB  of  0(1/11)*.  (See  [2],  [3]  for  details  of  this 
analysis.) 

Unfortunately,  for  D  <  K,  the  decomposition  of  (2.3) 
necessary  for  a  dynamic  programming  solution  is  not  possible 
because  of  the  coupling  among  symbols  in  the  expectation 
term.  This  means  that  maximum  likelihood  sequence  detection 
is  not  generally  computationally  feasible  (its  TCB  is  exponen¬ 
tial  in  the  number  of  symbols  per  user)  if  all  users’  signaling 
waveforms  are  not  known  and  locked.  Thus,  in  considering 
decentralized  demodulators  in  multiuser  channels,  we  will 
restrict  our  attention  to  algorithms  which  demodulate  only  a 
singte  symbol  at  a  time,  i.e.,  we  consider  one-shot  detectors. 
We  also  will  restrict  attention  to  the  binary  signaling  case,  in 
which  bk(i)  6  { —  1,  +  1}  for  all  i,  k.  Extensions  to  general 
alphabets  are,  in  most  cases,  straightforward. 

In  the  sequel,  we  will  consider  the  case  of  full  decentraliza¬ 
tion,  i.e.,  single-user  detection,  which  can  be  modeled  by  the 
binary  hypothesis-testing  problem 

H0:  r,=Si (f)  +  S“'M  +  «,,  0 <t<kT 

H{:  r,=  -St(t)  +  S“A  +  n„  0 <f<T  (2.6) 

where  {n,\  -00  <  t  <  00}  is  the  white  Gaussian  noise  and 
where 

=  S  [bLksk«-Tk+T)+b«sk(t-Tk)),  0 st<T 
*-2 


r,Sf{bD)dt~  j*  [S?(bD)]*dt 
N0  J-°>  No 

-Flog  E  |exp  [r,-5f(60)]5^  dt 

-If  [Sf1]**]]  (2-3) 

No  J  -»  ) 

where 

5f(*o)=S  bk(i)sk(t -  iT-  Tk),  -00  <(<oo 

kml  k.-M 

(2.4) 

S^  =  S,(h)-Sf(6D)  (2.5) 

and  where  the  expectation  is  over  the  ensemble  of  all  unknown 
quantities  in  SfM,  including  delays,  symbols,  and  (possibly) 
waveforms. 

Note  that,  even  if  we  ignore  the  complexity  of  computing 


(2.7) 

with  bLk  and  bRk  denoting  the  Arth  user’s  bits  in  the  intervals 
[-  T  +  rk,rk)  and  [rk,  T  +  r*),  respectively.  We  also  assume 
that  the  receiver  is  coherent  with  user  1  so  that  {.S|(/);  1  G  [0, 
T)}  is  a  deterministic  wavefonn,  and  that  each  user’s 
signaling  waveform  is  of  the  form 

sk(t)  =  {2wk)w2ak(t)  cos  (a >f/+0*)  (2.8) 

where  wc  is  known.  We  assume  that  (wf772ir)  is  an  integer 
large  enough  so  that  integrals  of  2uc  components  can  be 
neglected. 

Optimum  (maximum  likelihood/minim<  m  error  probability) 
decisions  for  (2.6)  are  based  on  compari.-g  the  likelihood  ratio 
to  a  threshold.  With  this  in  mind,  v/e  give  the  following  result. 

Proposition  2.1:  Suppose  that  the  phase  vector  of  the 
interfering  users  6  =  (02,  •  •  • ,  0*)  is  uniformly  distributed  on 
[0,  2a-)*"1  and  is  independent  of  bk  =  (bj-,  bRk),  k  =  2,  *  •  • , 
K,  T  =  (r2,  •  •  •,  tk)  and  ( ak(t ),  t  E  [0,  T],  k  =  2,  •  ■  • ,  K ). 
If  the  dependence  of  || SMA  ||  on  0  can  be  neglected,1  then  the 
likelihood  ratio  for  (2.6)  can  be  written  in  the  following  form- 


exp 


f 4w'/2  "1 

[-RT  L  r’ma'w  “j 


E  Io{(pl(bk,  Tk ,  1  )  +  'i'\(bk,  t*))i/2)  exp  f  ^  rkj(bk,  bjt  rk,  T;))yj 
L*-2 _ _ x  j- 2 _ 

r  k  (  k~i 

E  n  /o((Pi»*.  “  0  +  'I'libk,  rk))wl)  exp  (  -  2  Ykj(bk,  bj,  rk,  tj) 

-k.  2  \  y-2 


(2.9) 


the  expectation  in  (2.3),  the  time-complexity-per-demodu- 
!ated-bit  (TCB)  of  brute-force  maximization  of  the  log- 
likelihood  function  is  CH\A\ D{2M* ^/DM\A |)  where  \A  \  is  the 
size  of  the  symbol  alphabet.  Thus,  unless  some  simpler 
algorithm  can  be  found,  simultaneous  maximum  likelihood 
sequence  detection  of  D  users  is  out  of  the  question  from  a 
practical  point  of  view.  For  the  particular  case  of  fully 
centralized  detection  (K  =  D),  it  turns  out  that  a  much  simpler 
algorithm  can  indeed  be  found.  In  particular,  for  this  case,  the 
expectation  term  in  (2.3)  disappears  (since  S^A  as  0),  and  the 
remaining  terms  can  be  decomposed  in  a  way  that  allows 


where  the  expectation  is  over  the  ensemble  of  bits,  delays,  and 
possibly  waveforms  of  the  interfering  users2  and  we  use  the 
notation 

2(wk)l/2  ct 

Pk(.bk,  Tk,  e)  =  — — —  <xk{bLk,  bRk,  t-Tk) 

N0  i  0  * 

’  M0-evv!/2fl,(0]  dt  (2.10) 

For  a  waveform  x  =  (x(r),  0  s  t  s  T),  ]|x||:  denotes  x\t )  dt 
2  f0(  )  is  the  modified  Bessel  function  of  the  first  kind  of  order  0,  i  e  ,  l0(x) 
=  I/2x  jjr  exp  (x  cos  6)  d$. 
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2(W't.)l/2  fT  2  K  rT 

Mbk,  t*)  =  Jo  MbLk,  bRk,  t—Tk)rq(t)  dt  (2.11)  =.~  J  w"2  [(rp(t)-2ew\nax{t) 


„  2 {wk\Vjyn  fj-  4  s 

Tk,Tj)-  J0  °tk(bj-,  b%,  t-Tk) 

•  ctjibf,  bf,  t-Tj)  dt 

(2.12) 

•  cos2  ((Oct+0i))otk(bk,  b%,  t  Tk) 

‘  cos  (dk  - <4-7*  -  0, )  -  (rq(t)  -  ew\na{  (/) 

•  sin  (2wcf  +  6, ))ak(bj; ,bRk,t-7k)) 

rp  (t)  =  'fir,  cos  (wct  +  d i ) 

(2.13) 

•  sin  (0*-«cr*-0|)]  dt 

rq(t)  =  'fir,  sin  (af t  +  ) 

(2.14) 

K 

=  Y  Pkibk ,  Tk,  e)  cos  (fik-UfTn-O i) 

<xk(b,  c ,  \)  =  bak(\+  T)  +  cak(\). 

(2.15) 

km  2 

Proof:  The  likelihood  ratio  for  (2.6)  is  equal  to  the  ratio 
of  expected  values  of  conditionally  Gaussian  a  priori  densities 
where  the  expected  value  is  taken  with  respect  to  all  random 
quantities  in  SMA ;  this  is  given  straightforwardly  by 


-Mbk,  Tk)  Sin  (0*-wfr*-0|) 


(2.19) 


LR- 


r  i  i 

exp 

1 

1 

~ 

1 

Sp 

L-ZE 

exp 

where  the  last  equality  follows  by  neglecting  the  integrals  of 
the  2uc  terms.  Equation  (2.18)  is  immediate  from  (2.19)  and 
the  result  follows.  □ 

Note  that  the  structure  (2.9)  consists  of  the  single-user  (K  = 
1)  correlation  statistic 


2  wj/2 

“aT 


{ T  rp(t)ax(t)  dt  (2.20) 


(“p  N 


M  ||SAM||2  +  — SMA) 
No  N0 


]} 


h  [-£ 


||5^||2+_<r+5„5^) 


■]) 


(2.16) 


where,  for  functions  X  and  y  on  (0,  T],  the  notation  {x,  y) 
denotes  x(t)y(t)  dt.  The  first  ratio  in  the  above  expression 
is  readily  shown  to  be  equal  to  exp  ((4(w1),/2//Vb)r  \t 
rp(t)ai(t)  dt).  Now,  neglecting  the  dependence  of  ||5AM||  on 
0,  we  have  for  every  b  and  r 

_L 

No 


l|sAM!l 


L*-2  \'V0  jm  2 


‘kj(bk,  bj,  Tk,  Tj) 


)j 


(2.17) 


used  by  conventional  single-user  receivers,  modified  by  an 
additive  correction  term  which  accounts  for  the  other  users  in 
the  channel.  Note  that  the  received  waveform  enters  this 
correction  term  through  the  sliding  correlation  statistics  of 
(2.10)  and  (2.11). 

The  simplifying  approximation  in  Proposition  2.1,  which 
states  that  the  energy  of  the  multiple-access  interference 
process  is  independent  of  the  carrier  phases,  is  certainly 
accurate  when  ucT  is  sufficiently  large  and  the  normalized 
(i.e,  unit  energy)  cross  correlations  between  the  interfering 
users  are  low.  We  assume  throughout  this  section  and  the 
following  one  that  this  independence  is  valid.  If  such  an 
approximation  is  not  assumed,  then  it  can  be  shown  straight¬ 
forwardly  that  the  multiplicative  correction  term  in  the 
likelihood  ratio  (2.16)  is  equal  to 


E  To  •••  C  CXP  [t  0*(bk,Tk,  1) 


k- 1 

cos  ctk-'Pki.bk,  Tk)  sin  ak- £  TkJ  cos  ( ak-aj )  |  da2  •••  daK 
_ /-2 _ 


52*  p2r  K 

"•  ]  exp  pk(bk>  Tk>  -1) 
L  k»2 


k- 1 

cos  ak  -  Mbk,  Tk)  sin  ak  -  Tkj  cos  (a*  -  a,)  da2  daK 

J-2 


(2.21) 


So,  it  remains  to  show  that  for  all  ( ak ,  bk,  rk,  k  =  2,  •  •  -,  K), 
we  have 


f2r 

'0 


t  exp  (r-exu  5AM) 

INo  J 


dd 2  •  •  •  dd a- 
(2  *)x~' 


=  f[  Jo  (0>l(bk,  Tk,  e)  +  *l(bk,  r*))1'2).  (2.18) 

2 

To  this  end,  we  note  that  the  following  sequence  of  equalities 
holds: 


—{r~est,  SMA ) 

No 

S  >>!■  <-rO 


•  cos  (coct-o)cTk+$k)  dt 


Several  variants  of  the  general  structure  of  (2.9)  are  of 
interest  and  will  be  considered  here.  One  such  variant  is  that  in 
which  the  modulation  waveforms  of  the  interfering  users 
{<7*(0;  2  £  k  <  K)  are  known,  and  the  remaining  unknown 
quantities  in  {$*(/);  2  £  k  -s  K)  are  all  independent  with  the 
data  bits  and  delays  uniformly  distributed  in  their  ranges.  In 
this  case,  the  expectations  in  (2.9)  reduce  to 


£{(•)}  = 


(47T- 


i'SjX-l 


•f  E{(-)\b',T}dr2  ■■■  drK  (2.22) 
Jio.n*'-1 

where  b'  =  ( b2 ,  ■  •  bk)  and  where  the  inner  expectation  is 
over  the  amplitudes.  Thus,  the  computation  of  this  likelihood 
ratio  is  of  exponential  complexity  in  K.  Moreover,  there  will 
be  a  further  substantial  computational  burden  in  computing  the 
(K  —  l)-dimensional  integral  corresponding  to  averaging  over 
the  relative  delays  t2,  •  •  ■ ,  tk. 
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of  practical  interest,  namely,  when  the  number  of  chips  is 
large  and  when  the  white  Gaussian  noise  level  is  low. 

A.  Optimum  Single-User  Detection  for  Long  Spreading 
Sequences 

To  study  the  large-N  behavior  of  the  likelihood  ratio  of 
Proposition  1,  we  first  define  the  following  functions: 

ie(X)»  \T  [rp(t)-ew\'iax(t))\Kt-\)  dt  (3.2) 

.  Jo 

<H\)=\\g(tm-\)dt  (3.3) 


Fig.  1.  Optimum  single-user  detector  (K  =  2).  Replicas  of  a2(+  ,  + ,  -  f) 
and  a2(  + ,  - ,  -/)  are  generated  by  the  blocks  corresponding  to  the  signal 
of  the  second  user. 


Fig.  1  illustrates  the  particularization  of  the  demodulator 
derived  in  Proposition  2. 1  to  the  two-user  case.  Note  that  the 
quadrature  component  of  the  input  is  used  and  that  convolu¬ 
tions  [required  to  generate  (2.10)  and  (2.11)]  and  nonlinear 
memoryless  operations  are  also  needed. 

For  K  >  2,  the  delay  integrals  do  not  appear  to  be 
obtainable  in  closed  form.  However,  even  if  they  could  be,  the 
exponential  (in  K)  complexity  of  I,b,  shows  that  optimum 
one-shot  single-user  detection  in  a  /f-user  channel  is  at  least  as 
computationally  burdensome  as  centralized  simultaneous  se¬ 
quence  detection  of  fully  locked  users.  However,  one-shet 
single-user  detection  does  not  require  tracking  phases,  delays, 
and  amplitudes  of  all  users,  and  thus  may  be  preferred  if  these 
quantities  are  not  stable  for  relatively  long  periods  of  time. 
Moreover,  and  perhaps  more  importantly,  Proposition  2.1 
also  applies  to  situations  in  which  the  modulating  waveforms 
of  the  interferers  {a*(0;  k  =  2,  •  •  • ,  K)  are  not  known.  This 
situation  is  the  norm  for  the  noncentral  nodes  in  many  practical 
radio  networks,  and  thus  the  centralized  detection  algorithm  of 
[2]  cannot  be  applied  to  such  cases  unless  the  receiver 
estimates  the  unknown  signal  cross  correlations.  Furthermore, 
as  it  is  shown  in  Section  III,  an  important  reduction  in  the 
complexity  of  computing  (2.9)  results  from  the  modeling  of 
the  modulation  waveforms  of  the  interfering  users  as  being 
signature  sequences. 

III.  Single-User  Detectors  for  DS/SSMA  Channels 

In  practice,  one  of  the  most  important  types  of  code-division 
multiple-access  systems  is  direct-sequence  spread  spectrum 
This  corresponds  to  the  particular  case  of  the  model  f2  1), 
t2.2;,  and  t2.8j,  in  which  the  &th  user’s  signature  waveform  is 
of  the  form 


dkO)=  S  ckit(t-iTc), 
1-0 


0</<  T 


where  1  is  a  signature  sequence  of  binary  (±  1)  digits, 

the  chip  waveform  is  nonzero  only  on  [0,  7*e],  and  the  chip 
duration  Tc  is  given  by  Tc  =  T/N.  In  many  DS/SSMA 
muitipomt-to-multipoint  channels,  it  is  frequently  reasonable 
to  assume  that  user  1  knows  the  chip  waveforms  of  users 
2-A,  but  not  the  specific  signature  sequences  they  employ. 
Since  these  sequences  are  usually  chosen  to  be  pseudonoise 
sequences,  it  is  reasonable  to  model  them  (from  the  viewpoint 
or  user  1;  as  independent  sequences  of  independent,  equiprob- 
able  binary  digits.  In  this  section,  we  apply  this  model  for  the 
interfering  users  in  the  likelihood  ratio  formula  of  Proposition 
2.1.  As  we  will  see  below,  this  affords  a  much  more 
manageable  form  for  the  likelihood  ratio  in  the  limiting  cases 


gf(X,  0)  =  £<>(X)  cos  sin  9  (3.4) 

where  the  parameter  e  takes  on.  the  values  +  1  and  -  1  and  g, 
is  abbreviated  as  g+  and  g_,  respectively,  in  the  remainder  of 
the  section.  Now  fix  r  E  (0,  T)  and  suppose  that  n  E  {1, 
•  •  N }  is  such  that  (n  —  l)7*c  <  7  £  nTc.  Then  define 


df=  f  b*  ckj-n 
kl  Wk  Ck].n+N 


j-n>0 
j-n<  0 


y  =  0,  •••,rV  (3.5) 


and  notice  that  ge(r  -  T  +  iTc,  8)  =  0  for  i  £  N  -  n  -  1 
and  g,(r  +  iTc,  6)  =  0  for  /  &  N  -  n  +  1  because  \f/(t)  =  0 
for  t  E  [0,  7*fj.  Then  it  follows  that 

Mo 

r ~Tn  t Pk(bk,  T,  e )  cos  d-\j,k(bk,  t)  sin  8] 


=  t  ck,[bLkgt(T-T+iTc,  d)  +  b»ge(T+iTc,  0)] 

f-0 

=  ckN-nbLkg'{j~nTc,  d)  +  ckN.nb^ge{T-nTc+T>  9) 


+  S  dkigc(T+U-n)Tc,  8) 

/-I 

=  tdK‘Se(r+(i-n)Tc,8)  (3.6) 

.  i-0 

and  thus,  the  distribution  of  7o((p|(6*,  rk,  e)  +  yi/\{bk,  rk))wl) 
is  the  same  modulo  Tc  when  r*  is  uniformly  distributed. 

Let  us  now  consider  the  particular  case  of  a  single  interferer 
K  =  2  which  may  also  be  used  to  approximate  the  situation  in 
which  we  have  a  single  dominant  interferer.  In  this  case,  the 
correction  term  of  the  likelihood  ratio  is  equal  to 


rc-p 

nr— 


2  w\n  "  \ 

-rrtd2ig+{iTc-\,  B))d$  d\ 

Mo  t l _ / 

2iv'/2  N  \ 

-J-tdvgMTc-X,  8))  d8  dX 

Mo  t  J 


where  the  expectation  is  over  the  independent  and  equiproba- 
ble  sequence  dy  E  {-1,1},/  =  0,  •  •  • ,  N.  The  integrands  in 
the  numerator  and  denominator  of  (3.7)  are  products  of 
hyperbolic  cosines  which  do  not  lend  themselves  to  further 
simplification.  However,  if  N  (the  number  of  chips)  is  large, 
the  distribution  of  the  discrete  random  variable  di,ge(iTc 
-  6)  approximates  the  normal  curve,  and  further  simplifi¬ 

cation  of  (3.7;  is  possible.  To  justify  this  approximation,  we 
show  that  for  each  0,  X  and  each  realization  of 

x(t)  =  [rPU)-ev/\nai(t)]  cos  8-rAt )  sin  8 
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such  that  sup  {|*(/)|/  €  (0,  T)}  <  oo,  the  following 
triangular  array  of  random  variables'5, 

i'lrv/i 

x(t-\)  dt  i=\,  (3.8) 

0-1)77/1 

satisfies  the  Lindeberg-Feller  condition  (e.g.,  [5]) 
lim  V  E[^m/en)lI{\^m\>ben}]  =  0  for  every  6>0 

ft—QO 

i-I 

(3.9) 

where  e*  =  £[£".,  s"2,].  To  check  (3.9),  first  note  that 


lim-S 

/T  — 00  l 


ri=lim 


v  Ar2(f")rv«2 


T  LJ 
1  (-1 


=  ||.Y||2  (3.10) 

where  t"  G  ((/  —  1  /n)T,  ( iT/n ))  and  the  first  equation  in 
(3.10)  uses  the  mean-value  theorem  on  the  integral  of  (3.8). 
Therefore,  for  every  e  >  0,  there  exists  n0  such  that  for  all  1 
£  i  £  n  and  n.  >  ri0, 


/{lfml>^}</^r2,>52(|M|2-6)j  .  (3.11) 

But  for  each  ft  >  0,  we  ,uii  find  je  'hat  for  n  >  zi|,  we 
have 


/  ^lfm|2>^/  \jn  sup2|x(/)|>)ij  =0.  (3.12) 

Hence,  only  a  finite  number  of  terms  on  the  left-hand  side  of 
(3.9)  are  nonzero,  and  since  lim„-.<„  fjj/e2  =  0,  (3.9)  follows. 

If  p  cos  9  -  \p  sin  9  is  a  Gaussian  random  variable,  then  it  is 
straightforward  to  check  that 


£/ft(Vp2  +  fa-)  =  exp 


-(£[p2]  +  £M2]) 
4 


/•  (^(^[P2]-^[W  +  ^2[P^])  •  (3-13) 

Hence,  using  the  Gaussian  approximation4  to  the  distribution 
of  E"0  dug'(iTe  -  X,  9),  the  correction  term  in  (3.7)  reduces 
to 

Cexp(i(2‘<x,+w,,)/»( 

where 

2e(X)  =  f)$20Tc-X)  (3.15) 

f-0 


*(M=2  Wf-X) 


(3.16) 


and 


<■0 


©e(X)  =  2  £  fc(iT0-X)*(/7*c-X).  (3.17) 

i-0 

This  structure  is  illustrated  in  Fig.  2.  Note  that  there  is 
considerable  simplification  in  this  structure  over  that  of  Fig.  1 . 
In  particular,  each  of  the  "  +  "  and  channels  involves 
chip  matched  filtering  of  the  in-phase  and  quadrature  compo¬ 
nents  followed  by  chip-rate  sampling,  quadratic  accumulation, 
memoryless  nonlinear  transformation,  and  integration  over  the 
offset  X.  This  latter  operation  can  be  implemented  in  parallel 
form  by  decimating  an  tVf/7>rate  sampler  (rather  than  a  l/7> 
rate  sampler)  where  M  is  the  number  of  points  taken  in  the 
numerical  computation  of  the  integral. 

Further  simplification  of  the  correction  term  (2.9)  is  also 
possible  in  the  case  K  >  2  by  using  the  Gaussian  approxima¬ 
tion.  We  can  obtain  an  expression  similar  to  (3.14)  where  the 
integration  is  now  over  the  hypercube  [0,  7;]**'.  Analogously 
to  the  case  K  =  2,  for  each  9  =  (02,  •  •  ■ ,  0K)  and  r  =  (r2,  •  •  • , 
tk),  the  distribution  of 
n 

^  Pk(bk,  Tk,  e)  cos  9k-\ pk(bk,  Tk)  sin 

*-2  k_x 

‘  0*~  S  Vkj{bk,  bj,  Tk,  Tj) 

is  approximately  Gaussian,  and  since  Tk)  is  un^orrelated  with 
P*.  fa,  Pj,  and  \f/j,  both  the  numerator  and  the  denominator  of 
the  correction  term  in  (2.9)  are  approximated  by 

I  •  •  •  1  drz  ■  •  ■  drkE 


K  k- 1 
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Now,  £exp  (-S£.j  Tkj(bk,  b,,  rk,  t,))  depends  only  on 
r  and  on  the  chip  waveform,  and  since  if  X  —  iV( 0,  a),  then 
E  exp  X  =  exp  a- 12,  we  have 

K  k- 1 


E  exp  I  -  J)  £  r kj(bk,  bj,  Tk ,  Tj) 


k *2 j* 2 
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=  exp 
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5  For  the  sake  of  notational  simplicity,  here  we  consider  the  case  of  a 
rectangular  chip  waveform.  In  this  case,  fw  =  dvg,(iTe  -  X.  6). 

*  It  should  be  noted  that  the  use  of  a  central-limit  theorem  here  is  quite 
different  from  the  Gaussian  approximations  used  in  many  previous  analyses  of 
conventional  single-user  receivers.  Here,  we  do  not  claim  that  the  multiple- 
access  interference  is  asymptotically  a  white  Gaussian  process,  however,  we 
do  show  via  the  Lindeberg-Feller  condition  (3.9)  that  the  decision  statistics  in 
(3.7)  arc  conditionally  Gaussian  random  variables  as  the  number  of  chips  per 
symbol  goes  to  infinity. 
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(3.18) 
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Fig.  2.  Correciion  statistic  for  single-user  detector  with  long  spreading 
sequences. 


where  the  last  approximation  follows  by  assuming  that  \J/(t)  = 
1/7I/2  for  t  E  [0.  Tc ]  and  by  neglecting  an  0(l/N2)  term  in 
[the  exponent.  Hence,  the  overall  correction  term  is  approxi- 
i  mately  equal  to 


N0  -*  0.5  Hence,  rather  than  using  (3.19),  we  must  take  the 
limit  as  No  0  of  the  original  likelihood  ratio  (2.9).  As  in  the 
previous  analysis,  we  will  First  focus  attention  on  the  case  of  a 
single  interferer  ( K  =  2). 


(  •  •  •  (  n  exP  (tFi  (H«.(t*)  +  *(t*)))  V(S,(r,)-$(rJ)2  +  e2+(r,) 

•Wd  ->10.71  \Nl _ )  \N\ _ 

(  •  •  •  ■(  11  exP  ( 5  (S-(r*)  +  *(r*»)  4  (~  V(S_  (7>)-4>(r*))2  +  02_  (r*) 

Vr,i  -V,!  t.i  \N20  )  \N 5 


*- 1 

n  exp 
/'-2 

~2wkwj  i 

f  \ric- Tj\2+ (Tc- \rk- tj  |)2  V 

drz  drK 

.  NNl  ' 

V  Tc  )\ 

k- I 

n  exp 

~2wkwj  > 

(  [t*  -  Tj\ 2  +  (Tc- \rk- Ty|)2  Y 

dr 2  •  •  •  dTfc 

.  NNl 

V  Tc  * 

(3.19) 


Notice  that  the  term  that  couples  the  integrals  in  (3.19)  is 
asymptotically  independent  of  r  as  N  oo.  Hence,  (3.19) 
approaches  the  product  of  K  -  1  (3  14)-like  terms  (substitut¬ 
ing  w;  by  »v*).  Thus,  in  this  limiting  case,  implementation  of 
the  multiuser  correction  term  in  the  likelihood  ratio  involves 
the  implementation  of  only  one  chip-matched-filter/quadratic- 
ficcumulator  section  followed  by  multiple  averaging  channels, 
fine  for  each  different  value  of  w*.  Fig.  2  shows  an 
implementation  of  the  correction  statistic  to  be  added  to  the 
output  of  the  single-user  matched  filter  in  the  case  of  a  single 
interferer.  The  general  structure  is  the  same,  except  that  the 
memoryless  nonlinearities  output  a  process  for  each  interferer 
which  is  then  passed  through  a  separate  logarithmic  integrator. 

Optimum  Single-User  Detection  for  High  SBNR 

l  We  now  turn  to  another  limiting  case  of  the  single-user 
detector  for  which  a  simplified  form  of  the  likelihood  ratio 
exists,  namely,  the  case  when  the  power  spectral  density  of  the 
additive  Gaussian  noise  goes  to  zero.  In  the  above  case,  we 
saw  that  when  the  rest  of  the  parameters  are  Fixed,  we  can  use 
a  Gaussian  approximation  as  N  -*  oo.  However,  for  fixed  N, 
the  error  between  the  expected  values  of  the  exponentials, 
ircording  to  the  true  and  Gaussian  distributions,  diverges  as 


Since  the  spreading  codes  of  the  interfering  users  are 
modeled  by  the  single-user  receiver  as  equiprobable  and 
independent  binary  sequences,  the  correction  term  of  the 
likelihood  ratio  is  given  by  (3.7)  and  the  log-likelihood  ratio  is 
(except  for  a  positive  multiplicative  constant)  equal  to 


No 

2wj/2 


2  \T rp(t)at(t)dt+ 


i'Tc  t’2r  /  2W1/1  \ 

log  £exp  (  ■—=-  dz,g+(iTc-\,  0)j  dO  d\ 


No 

2w|/2 


.os  n  r 

Jo  Jo 
(2  Hf2  " 


E  exp  2  d2lg-(iTc-X,  0)J  d6  d\  (3.20) 


5  This  is  due  to  the  fact  that  as  the  variance  goes  to  infinity,  the  error 
belween  the  distributions  accumulates  on  the  tails  (the  true  random  variable  is 
bounded)  on  which  the  expected  value  of  the  exponential  largely  depends. 
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where  the  expectation  is  over  the  independent  and  equiproba- 
ble  sequence  d2i  E  {  -  1,  I } ,  /  =  0,  •  •  •,  N.  On  taking  the 
limit  of  the  correction  terms  in  (3.20),  we  obtain 


..  'Vo 
urn  — — 

N0-0  2w\n 


/2wl/2  -v  \ 

£exp  g  d2lge(iTc-\,  6)J  dd  d\ 

[  a: 


=  lim  --rlog 
N0-0  w|/2 


'2t 
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j^exp  ^v]'2  |$e(/7f  -  X ,  0)!^)  °  d6  c/xj  '  °/_ 


-  >irTl  ~h  >°g  2 


No-°  2  w  }/2 

=  ■^75  log  sup  jexp  (\v\/2  j  |&(/7*C-X,  0)|^j 


=  (w2/wi)u2  sup  ^  \Se(‘Tc-\,  0)|. 
xeio.r.i  ,„0 

96(0.2r| 


(3.21) 


Therefore,  in  the  limit  as  A'o  -»  0,  the  optimum  single-user 
detector  for  K  =  2  in  the  case  of  unknown  interfering  codes 
compares  the  test  statistic 

{T  N 

2  1  rp(t)at(t)  dt  +  (w2/w,y*  sup  V  | gAiTc-\,  6) | 

°  xel°-7'<rl 

oeto.jii 


~(w2/w,)w2  sup  V  \g.VTc-\,d)\  (3.22) 
xeio.rfi  ,„0 

9€|0.2r| 

to  a  zero  threshold.  Note  that  as  might  be  expected,  (3.22)  is 
also  the  limiting  form  of  the  generalized  likelihood  ratio  test  or 
maximum  likelihood  detector  (see  Helstrom  [6,  p.  291],  for 
example). 

We  now  investigate  the  error  probability  of  the  test  in  (3.22) 
when  A'o  =  0.  It  was  shown  in  [4]  that  when  the  delays, 
phases,  and  waveforms  of  all  users  are  known,  the  fully 
centralized  optimum  detector  achieves  perfect  demodulation 
with  probability  l  in  the  absence  of  background  noise.  This  is 
a  nontrivial  result,  as  is  illustrated  by  the  behavior  of  the 
conventional  single-user  detector  which  becomes  multiple- 
access  limited,  i.e.,  the  limit  of  its  error  probability  as  A'o  -♦  0 
is  nonzero  for  sufficiently  powerful  interfering  users.  How¬ 
ever,  as  in  the  present  case,  the  conventional  detector  does  not 
have  access  to  the  delays,  phases,  or  signature  sequences  of 
the  interfering  users.  So,  the  question  arises  as  to  whether  an 
optimum  single-user  detector  can  achieve  error-free  perform¬ 
ance  regardless  of  the  energies  of  the  interfering  users  without 
knowledge  of  those  parameters.  The  answer,  in  the  two-user 
case,  is  given  in  the  affirmative  by  the  following  result  which 
does  not  put  any  restrictions  on  the  signature  sequences. 
Proposition  3.1:  Suppose  K  =  2  and  W|  >  0.  If  /■(/)  = 


bsi(t)  +  S',A(l),  b  E  {-1,  1},  then 
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(3.23) 


with  probability  1. 

Proof:  See  Appendix. 

In  the  general  case  of  K  >  2  users,  the  log-likelihood  ratio 
is  proportional  to  (cf.  (2.16)] 

2 vj\n  rp(r)a,(0  dl 
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dd  dr 


,3.24) 

where  the  expectation  is  over  the  independent  sequences  d  = 
{dk,  €  {-1,  1};  i  =  0,  •••,  A',  k  =  2,  K).  As  in 

(3.21),  this  expectation  is  dominated  as  A'o  0  by  the  atom 
corresponding  to  the  largest  integrand,  i.e., 

d *  E  arg  maxd  0 e{d,  t,  0)  (3.25) 

d 

where 


n e(d,  T,  9)  =  {r-esi,  S"'\d))-[-  ||S^(rf)||2  (3.26) 
and 


SSM«,  d)  =  %  J  dki(2wky'^(t-0-DTc-rk) 


(-0  k*2 


•  cos  (oj ct  +  9k-wcTk).  (3.27) 

Since  there  ur*.  2KIN*''1  possible  sequences,  it  is  necessary  to 
find  an  efficient  way  to  carry  out  the  maximization  in  t3.25j. 
But  (3  26)  and  (3.27)  have  the  same  structure  as  (2.3)  and 
(2  4).  respectively,  so  we  can  apply  the  results  of  (2]  to  carry 
out  the  maximization  of  (3.25)  with  linear  complexity  in  A. 
On  taking  the  limit  of  (3.24)  as  A'o  -*  0,  we  obtain  the  test 
statistic 

2<2(  (t)  dt  +  sup  Q*(t,  9)  —  sup fi*(r,  d)  (3.28) 

•>0  r.9  f.9 

where  ft*(r,  6)  -  Qe(d*,  r,  $).  Even  if  these  quantities  are 
obtained  through  efficient  dynamic  programming  recursions 
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as  in  [2],  the  main  computational  burden  of  (3.28)  is  the 
maximization  over  [0.  7',.]**1  and  (0.  2tt]k~1,  which  imposes 
severe  limitations  on  its  feasibility  for  even  a  moderate  number 
of  users.  However,  note  that,  in  performing  the  maximization, 
the  receiver  is  essentially  acquiring  the  chip  timing  and  carrier 
phases  of  the  interfering  users.  Thus,  in  practice,  it  would 
normally  be  unnecessary  to  undergo  a  full  search  for  the 
maximizing  r  and  0  in  each  symbol  interval  since  these 
quantities  will  change  little  from  symbol  interval  to  symbol 
interval.  For  this  reason,  (3.28)  might  be  reasonably  efficient 
to  implement  in  approximate  form. 

IV  Locally  Optimum  Single-User  Detectors  with  Weak 

INTERFERERS 

We  have  seen  in  the  preceding  sections  that  in  multiple- 
access  environments  with  many  users,  the  complexity  of 
optimum  detection  is  increased  considerably  (over  centralized 
reception)  when  the  unwanted  users  are  unlocked.  This  is  true 
even  without  sequence  detection  and  regardless  of  whether  the 
interfering  waveforms  are  known.  However,  one  of  the  main 
incentives  for  the  study  of  optimum  decentralized  detectors  is 
the  situation  in  which  all  or  some  of  the  interfering  users  are 
comparatively  weak,  so  that  it  may  be  impractical  to  provide 
reliable  synchronization  for  them.  The  objective  of  this  section 
is  to  derive  locally  optimum  (up  to  a  third-order  approxima¬ 
tion)  decentralized  detectors  for  reception  in  the  presence  of 
weak  unlocked  users.  As  we  shall  see,  such  detectors  can  be 
viewed  as  versions  of  the  detector  that  would  be  optimum 
without  the  weak  interferers,  modified  to  be  robust  against 
small  deviations  from  the  nominal  white  Gaussian  noise 
statistics  caused  by  weak  multiple-access  interference.  As  in 
the  preceding  sections,  we  consider  both  the  case  in  which  the 
waveforms  of  the  interfering  users  are  known,  and  the  case  in 
which  they  are  coded  with  binary  signature  sequences  un¬ 
known  to  the  receiver.  We  will  see  here  that  the  locally 
optimum  version  takes  care  only  of  the  nonwhiteness  of  the 
multiple-access  noise. 

The  approach  we  follow  to  derive  locally  optimum  decen¬ 
tralized  demodulators  is  to  obtain  an  asymptotic  form  of  the 
log-likelihood  ratio  for  signal  detection  in  contaminated  white 
Gaussian  noise  given  by  the  following  result. 

Lemma  4.1:  Consider  the  following  pair  of  statistical 
hypotheses: 

H0:  r,=s°t  +  efi'  +  n,  t  G  [tp,  tf) 

Ht:  r,=s)  +  €/i;  +  /t,  t  G  [tp,  tf]  (4.1) 

where  s1  and  s°  are  deterministic  finite-energy  signals,  {n,}  is 
white  Gaussian  noise  with  spectral  height  a 2,  and  {n„  t  E  [to, 
tf]}  in  a  symmetric  random  process  such  that  ||«||  <  B  (a.s.) 
for  some  constant  B,  and  whose  correlation  function  is 
denoted  by  C,,x  =  (t,  X)  G  (r0.  tf]1.  Then  the  log- 

likelihood  ratio  for  (4.1)  admits  in  the  following  expression: 

««-?  11  (O’ 

dt+0(e<). 


Proof:  Using  the  Cameron-Martin  likelihood  ratio  for¬ 
mula,  we  obtain 

log  LR (e)  =  log  (4-3) 


where 


D,(e)  =  E  j^exp  (^-^(V'  +  e/tll2 

-2  y  (si  +  en,)  dr,)j  /= 0,1  (4.4) 

where  the  expectation  is  over  the  ensemble  of  sample  functions 
of  {n„  t  E  [tp,  (/]}. 

In  order  to  derive  (4.2),  we  take  the  Taylor  series  expansion 
of  (4.3)  around  the  origin.  Since  n,  is  a  symmetric  random 
variable,  it  follows  that  Df  —  e)  =  Z>,(e).  and  hence  the  odd 
terms  in  the  Taylor  expansion  of  A(t)l<-oand  log  A(e)l<-oare 
equal  to  zero. 

Using  the  fact  that  ||n||  <  B  a.s.  and  the  Schwarz 
inequality,  it  follows  that  the  expectation  of  every  coefficient 
in  the  series  expansion  of  the  exponential  in  (4.4)  exists,  and 
we  can  write 


A(e)  =  A(0)  [l  +Y  £  J*  n,[r,-s\)  dtj 

-^||n||2J+0(€J)]  .  (4.5) 
Now,  since  log  (1  +  x)  =  x  +  0(x 2),  we  obtain 

log  =  log  + 1!  £  [( 1  ('r  n,[r,  -  s')  dt) * 
SA>(e)  A>(0)  2  |A*2\  ‘  ) 

-(“  +0(e<)  (4.6) 

and  (4.2)  follows  straightforwardly.  □ 

Notice  that  the  stringent  condition  ||/?||  <  B  (a.s.)  allows  a 
straightforward  proof  of  Lemma  4. 1  and  is  satisfied  in  the  case 
in  which  we  are  interested,  namely, 

2  t  -  IT- rky,  bk[i)E{-  1,1}. 


/.  -M  k.D*  1 


If  the  waveforms  {ak[t),  k  =  D  +  1,  •  •  •,  K}  are  known  by 
the  receiver,  then  the  autocorrelation  function  of  n  with 
support  in  (Fi2  (for  Si  =  co)  is  equal  to 

Caicos  (wc(/-\))  Yj  wkRk(t-\)  (4.8) 

*  L  _  n  1. 1 


Rk(t)-  \  ak(s-t)ak(s)  ds.  (4.9) 

jo 

If  the  waveforms  of  the  interfering  users  have  the  form  in 
(3  1)  and  the  code  of  each  user  is  unknown  by  the  receiver  and 
assumed  to  be  equiprobably  distributed  among  all  {-1,  1} 
s.quences  of  length  N,  then  the  autocorrelation  is 

C^=icos(a»c(/-X))  f  "**(t~V  (4-10) 

‘  Jt-D+I 

where  the  autocorrelation  of  the  chip  waveform  is  denoted  by 

no  =  !of  MOMs  -  o  dt. 

The  one-shot  single-user  detector  can  be  obtained  readily 
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from  the  result  of  Lemma  4.1.  Since  the  signal  of  user  1  is 
antipodally  modulated,  we  have 

5/,-S®=2V2H’|fl|(f)  cos  (gjc/  +  0|) 


and  (4.2)  becomes 


log  LR  = 


■  a.  2  vv*  (  dt 

L  No  T  *>o 


+  0(maxw£).  (4.11) 

Hence,  the  locally  optimum  one-shot  stngle-user  detector  is 
a  conventional  correlation  receiver  in  which  <2|(r)  is  replaced 

by  «,(/)  -  (l/.V,r)  2*„2  w* \T0  a{(\)Rk(t  -  X)  d\,  t  £  [0, 
T\,  i.e.,  the  pseudosignal  is  the  output  in  [7',  27"]  of  a  causal 
linear  filter,  driven  by  a\(t),  and  whose  impulse  response  is 
equal  to  5(t-T)  -  (l/N0T)  wkRk(t- 7").  If  the 
signature  sequences  are  unknown,  the  impulse  response  is  8(t 
—  T)  -  ( 1/NqT )  Sfm2  wkV(t  —  T),  which  amounts  to  a 
mild  smoothing  of  the  signal  replica  of  the  user  of  interest. 

The  locally  optimum  detector  that  locks  to  D  of  K  users  is, 
in  fact,  a  generalization  of  this  conclusion.  Using  Lemma  4.1, 
it  can  be  shown  (see  (3,  ch.  5]  for  details)  that  the  locally 
optimum  D- user  detector  is  a  centralized  detector  whose 
correlators  use  replicas  of  the  unmodified  waveforms  of  the 
users  of  interest.  However,  the  input  is  processed  by  a  causal 
filter  that  whitens  the  interference  due  to  unlocked  users,  and 
whose  impulse  response  depends  on  the  autocorrelation 
function  and  signal-to-noise  ratio  of  each  interfering  signal. 
This  requires  a  modification  of  the  DP  algorithm  to  account 
for  the  intersymbol  interference  introduced  by  the  prefilter, 
and  results  in  a  complexity  of  0(22B)  as  opposed  to  0(2°)  for 
the  corresponding  algorithm  that  neglects  the  additional  K  - 
D  interferers. 


V.  Summary 

In  this  paper,  we  have  obtained  decentralized  single-user 
detectors  which  take  into  account  the  presence  of  interfering 
users.  The  general  decentralized  demodulation  problem  is  one 
of  sequence  detection  in  additive  colored  non-Gaussian  noise, 
and  results  in  nonlinear  detectors  whose  decision  algorithms 
do  not  admit  recursive  forms  and  hence  are  more  complex  than 
their  centralized  counterparts.  Important  reductions  in  com¬ 
plexity  occur  when  attention  is  focused  on  one-shot  single-user 
detectors. 

The  general  form  of  the  single-user  likelihood  ratio  obtained 
in  Proposition  2. 1  is  equal  to  the  single-user  likelihood  ratio 
affected  by  a  correction  term  which  depends  on  both  the  in- 
phase  and  quadrature  components  of  the  input.  Both  the  case 
where  the  baseband  interfering  waveforms  are  known  and  the 
case  where  they  are  coded  by  an  unknown  signature  sequence 
have  been  studied. 

Under  the  assumption  that  the  assigned  waveforms  are 
signature  sequences  with  N  chips  per  bit,  we  have  obtained 
limiting  forms  of  the  correction  term  for  N  >  1  and  for  No  = 
0.  In  the  first  case,  the  correction  term  depends  on  the  received 
waveform  only  through  the  functions  Et(X),  4>(X),  and  Q.(\) 
which  represent  the  l2  norms  and  inner  product,  respectively, 
of  the  subintegrals  of  an  N  partition  (with  offset  X  £  [0,  TJ) 
of  the  in-phase  and  quadrature  components  of  the  received 
noise  process  under  both  hypotheses.  The  correction  term 
when  No  =  0  is  best  illustrated  in  the  single-interferer  case 
where  it  is  obtained  through  the  maximization  over  the  relative 
phase  and  delay  of  the  l\  norm  of  the  above  subintegrals.  It  has 


been  shown  that  this  detector  twhich  assumes  knowledge  of 
only  the  chip  waveform  and  energy  of  the  interfering  user) 
achieves  perfect  demodulation  in  the  absence  of  Gaussian 
noise  regardless  of  the  energy  of  the  interference,  thus 
avoiding  the  multiple-access  limitation  that  plagues  the  con¬ 
ventional  receiver.  Using  dynamic  programming,  the  single- 
user  detector  can  be  implemented  in  linear  time  in  N; 
however,  its  main  computational  burden  is  the  maximization 
over  [0,  7"c]Ar_l  and  [0,  2ir]K~'  needed  in  the  correction  term. 

Using  an  asymptotic  form  of  the  log-likelihood  ratio  for 
signal  detection  in  contaminated  white  Gaussian  noise,  we 
have  derived  locally  optimum  detectors  up  to  a  third-order 
approximation  in  the  amplitude  of  the  interfering  users.  The 
locally  optimum  one-shot  detector  has  been  shown  to  be  a 
single-user  correlation  receiver  which  uses  a  smooth  replica  of 
the  signal  of  interest  It  has  been  shown  in  [3]  that  this 
approach  can  be  generalized  to  the  case  of  partial  decentraliza¬ 
tion  ( D  >  1),  resulting  in  robustified  versions  of  the 

centralized  £>-user  receiver,  which  may  offer  substantial 
computational  savings  over  the  optimum  /f-user  receiver. 

Appendix 

Proof  of  Proposition  3.1 

We  assume  that  the  bit  transmitted  by  user  1  is  b  =  1 ,  the 
proof  being  identical  in  the  antipodal  case.  For  notational 
convenience  and  without  lost  of  generality,  we  suppose  that 
the  relative  delay  of  the  interfering  user  is  0  <  r2  ^  jTc;  then  it 
follows  that 


a2(^,  b\t  cW(f-/re+\2)  (A.l) 

J-0 

where  X2  =  Tc  -  r2,  d0  =  c2s- \b2,  and  di+i  =  c2ib?  for  /  = 
0,  N  -  1 .  Let  /3  =  9\  +  uct2  -  d2;  then  it  is  easy  to  show  that 

(  />(/)fl,(/)  =  w]/2+  w\n  cos  /3 

Jo 

•  jo  ai{t)a2(b^,  b§,t-T2)dt  (A.2) 
g+(iTc-\,  e)=w['2  cos  (0  +  j8) 

•  jora2(^,  b*,  t-nMt-iTc  +  \)dt  (A. 3) 

and 

g-{iTc-\,  0)  =  2iv[/2  cos  0 

•f  a\(t)\l/{t-iTc+X)  dt  +  g^(iTc-\,  0).  (A.4) 

Jo 

We  show  now  that 

sup  |g+(/Tc-X,  0)|  =  wj/2.  (A. 5) 

xeto.rci 
ne|o.2x| 

To  that  end,  using  (A.  1)  and  (A. 3),  we  obtain  for  every  X  £ 
(0,  Tc] 


sup  V  M/Tc-X,0)| 

*eio.:r|  “ 

=  <2 1  iS^f.  bf,  t-TM(t-iTt+\ )  dt\ 
1.0  I  ■ 
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=  w22  S  J  2  -JTe+hWV  ~  iTc+\)  dt 

l-O  I  J°  ;» 0 

-w2/2  J','  t  t  Mt-jTc  +  \i)\m-iTc+\) 


T  N 


=Sw"2  2  nt-jTc+h)dt 


jwO 


1/2 


i p2(t  —  iTc  +  \)  dt 


1/2 


1/2 


(A. 6) 


where  the  last  two  equations  follow  from  the  Schwarz 
inequality  and  from  the  relationship  f (\t-2( t  +  s')  +  t p2(t  — 
T(  +  s))  dt  =  Jo<  V'(t)  =  l/N,  0  £  s  £  7"C)  respectively. 
But  the  right-hand  side  of  (A. 6)  is  achieved  when  X  =  X2; 
hence,  (A. 5)  follows.  Consequently,  in  order  to  show  that  the 
sign  of  the  log-likelihood  ratio  is  positive,  one  has  to  prove 
that 


2wt  +  w2  +  vvj/2vvj'2  j"  2ai(t)a2(b^,  b$,  t-r2) 


•  cos  0  dt  -  tv"2  |g  .  (tTc  -  X ,  0) j  >  0  (A.7) 

i-0 

for  all  X  G  [0,  7V]  and  6  €  [0,  2n].  Using  (A. 3)  and  (A. 4),  we 
obtain 


2w,  +  vv2  +  tv]'2tv‘/2  |  2a\{t)a2(b (■,  b%,  t-r2)  cos  0  dt 
I  ”  IV2 2  S  \8-(‘Tc-'k,  d)\ 

I*  0 


-2w\  +  w2  +  w \/2  wl2/2 
■  \  2a\(t)a2(b^,  b^,  t-r2)  cos  0  dt 

V>0 

“  VVi/2  S  (  (2wj/2ff|(0  cos  9+  wj'2 

1  =  0 

•  a2(b b^,  t-r2)  cos  (6+0))i>(t-iT<:+\)  dt 


(A. 8) 

The  last  term  on  the  right-hand  side  of  the  above  equation 
’an  be  bounded  as  follows: 

]£  j  J  (2vvj/2a|(0  cos  0  +  w[/2ct2(b^,  b§,  (-t2 ) 

•  cos  (0+0))^(/-/Tc+X)  dt 

-S  \  |2‘v;/2a,(0  cos  6+w\/2a2(b^,  b\,  t-r2) 

,To  Jo 

■  cos  (0+0)HlK'-'Tf+X)|  * 

=  f  j2wj'2O|(0  cos  5+  w^'2a2(b(,  02,  t-r2) 

Jo 


•  cos  (0  +  0)1  g  |*(/-iTe  +  X)|  dt 


l-O 


j  (2wj/2fl|(f)  cos  C  + tv^/2a2(62 >  b2,  t-r2 ) 


cos  ( 6  +  0 ))2  dt 


1  1/2  T  fT  At 

J  [5*,?. 


\p2(t-iTc  +  \)  dt 


1/2 


=  4 vvt  cos2  6  +  w2  cos2  (0  +  0)  +  4>vJ/2wl/2 


tT  1  1/2 

cos  8  cos  (6  +  0)  a|(f)a2(6^,  b t-r2 )  <* 


(A.  9) 


Since  |j^ a,(t)a2(b^,  bf ,  t  -  t2)  <*|  <  1,  we  can  denote  j2- 
ai(/)a:(&£,  b$,  t  -  72)  eft  =  cos  a,  and  using  (A. 9),  the 
right-hand  side  of  (A. 8)  can  be  lower  bounded  by 


2iV|  +  w2  +  vvj/2wj/2  |  2a\(t)a2(b^,  b$,  t-r2 )  cos  0  dt 
|  (2  w  j/2cr,  (t)  cos  9  +  w)'2 
a 2(6J,  b§,  t  —  t2)  cos  (6  +  0M(t-iTc  +  \)  dt 


-<2S 

#■0 


>2tv(  +  w2  +  2wj/2  cos  a  cos  0 

-  tv^2(4W|  cos2  d+w2  cos2  (6  +  0) 

+  4w\/2wl2n  cos  6  cos  (8  +  0)  cos  a)l/2.  (A.  10) 

Now,  since  2w(  +  w2  +  2wj/2wJ/2  cos  a  cos  0  >  0,  the  sign 
of  the  right-hand  side  of  (A.  10)  is  equal  to  the  sign  of 

(2tV|  +  w2  +  2wj/2 wl2/z  cos  a  cos  0)2 

-[4vV|W2  cos2  0  +  tv2  cos2  (6  +  0) 

+  4w2wj/2wi/2  cos  6  cos  a  cos  (6  +  0)] 

=  (2w\  +  2w\,2wyl  cos  a  cos  0)2  +  4W|tv2(l  -cos2  8) 

+  vv|(l  -  cos2  (0  +  0))  +  4w2vv[/2,.v{/2 

•  cos  a  (cos  0-cos  6  cos  (0  +  0)] 

=  (2tv, +  2w[/2wJ/2  cos  a  cos  0)2  +  4n,1tv2  sin2  6 

+  vv^sin2  (6  r  0)  +  4w2w\/2w,2/2  sin  6  sin (9  +  0)cosa 

=  (2iV|  +  2wJ/2jv^/2  cos  a  cos  0)2 

+  (w2  sin  (0  +  0)  cos  a  +  2 Vtv,  w,  sin  0)2 

+  (w2  sin  (0  +  0)  sin  a)2.  (A. 11) 

Therefore,  we  have  shown  that  (A.  10)  and,  consequently, 
the  left-hand  side  of  (3.22),  are  nonnegative.  Moreover,  the 
right-hand  side  of  (A.  1 1)  is  equal  to  zero  only  if 

2wt  +  2w]/2h^/2  cos  a  cos  0  =  0,  (A.  12) 

but  since  0  =  6 *  +  wcr2  -  d2  is  uniformly  distributed,  (A.  12) 
occurs  with  probability  zero  if  h>i  >  0. 
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Abstract  — We  consider  an  asynchronous  code-division  multiple-access 
environment  in  which  the  receiver  has  knowledge  of  the  signature  wave¬ 
forms  of  all  the  users.  Under  the  assumption  of  white  Gaussian  back¬ 
ground  noise,  we  compare  detectors  by  their  worst  case  bit  error  rate 
in  a  low  background  noise  near- far  environment  where  the  received  en¬ 
ergies  of  the  users  are  unknown  to  the  receiver  and  are  not  necessarily 
similar. 

Conventional  single-user  detection  in  a  multiuser  channel  is  not 
near- far  resistant,  while  the  substantially  higher  performance  of  the  op¬ 
timum  multiuser  detector  requires  exponential  complexity  in  the  number 
of  users.  We  explore  suboptimal  demodulation  schemes  which  exhibit  a 
low  order  of  complexity  while  not  exhibiting  the  impairment  of  the  con¬ 
ventional  single-user  detector.  Attention  is  focused  on  linear  detectors, 
and  It  is  shown  that  there  exist3  a  linear  detector  whose  bit-error-rate 
is  independent  of  the  energy  of  the  interfering  users.  Moreover  it  is 
shown  that  the  near- far  resistance  of  optimum  multiuser  detection  can 
be  achieved  by  a  linear  detector.  The  optimum  linear  detector  for  worst- 
case  energies  is  found,  along  with  existence  conditions,  which  are  always 
satisfied  in  the  models  of  practical  interest. 


I.  Introduction 

THE  near-far  problem  is  the  principal  shortcoming  of  current 
radio  networks  using  direct-sequence  spread-spectrum  multiple- 
access  (DS/SSMAj  communication  systems.  Those  systems  achieve 
multiple-access  capability  by  assigning  a  distinct  signature  waveform 
to  each  user  from  a  set  of  waveforms  with  low  mutual  crosscorre¬ 
lations.  Then,  when  the  sum  of  the  signals  modulated  by  several 
asynchronous  users  is  received,  it  is  possible  to  recover  the  infor¬ 
mation  transmitted  by  correlating  _the  received  process  with  replicas 
of  the  assigned  signature  waveforms.  This  demodulation  scheme  is 
conventionally  used  in  practice,  and  us  performance  is  satisfactory  if 
two  conditions  are  satisfied:  first,  the  assigned  signals  need  to  have 
low  crosscorrelations  for  all  possible  relative  delays  between  the  data 
streams  transmitted  by  the  asynchronous  users,  and  second  the  pow¬ 
ers  of  the  received  signals  cannot  be  very  dissimilar.  If  either  of  these 
conditions  is  not  fulfilled,  then  the  bit-error-rate  and  the  antijamming 
capability  of  the  conventional  detector  are  degraded  substantially.  The 
reason  why  system  performance  is  unacceptable  when  the  received 
energies  are  dissimilar  even  with  good  (i.e.,  quasiorthogonalj  signal 
constellations,  is  that  the  output  of  each  correlator  or  matched  fil¬ 
ter  contains  a  spurious  component  which  is  linear  in  the  amplitude 
of  each  of  the  interfering  users.  Thus,  as  the  multiuser  interference 
grows,  the  bit-error-rate  increases  until  the  conventional  detector  is 
unable  to  recover  the  messages  transmitted  by  the  weak  users. 

Due  to  the  severe  reduction  of  the  multiple-access  capability  and 
the  increase  of  vulnerability  to  hostile  sources  caused  by  the  near-far 
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problem  and  its  ubiquity  in  networks  with  dynamically  changing 
topologies  (such  as  mobile  radio),  its  alleviation  has  been  a  tar¬ 
get  of  researchers  in  the  area  for  several  years.  However,  success 
has  been  very  limited  and  the  only  remedies  implemented  in  prac¬ 
tice  have  been  to  use  power  control  or  to  design  signals  with  more 
stringent  crosscorreiation  properties,  which  as  we  have  noted,  does 
not  eliminate  the  near-far  problem. 

The  viewpoint  of  this  paper  is  that  the  near-far  problem  is  not  an 
inherent  shortcoming  of  DS/SSMA  systems,  but  of  the  conventional 
single-user  detector.  The  optimum  multiuser  detector  was  obtained 
in  [1]  and  was  shown  to  be  near-far  resistant  in  the  sense  that  a 
(very  good)  performance  level  can  be  guaranteed  regardless  of  the 
relative  energy  of  the  transmitters.  The  optimum  multiuser  detector 
consists  of  a  bank  of  matched  filters  and  a  Viterbi  algorithm  whose 
complexity  is  exponential  in  the  number  of  users.  In  decentralized 
applications  (where  each  receiver  is  only  interested  in  demodulat¬ 
ing  the  data  sent  by  one  transmitter),  it  is  possible  to  drastically 
reduce  the  a  .nplexity  of  the  optimum  receiver  (.without  compromis¬ 
ing  performance)  by  neglecting  all  but  the  comparatively  powerful 
interferers.  However,  in  this  paper  we  propose  a  receiver  (which  we 
refer  to  as  the  decorrelating  receiver)  whose  complexity  is  only  lin¬ 
ear  in  the  number  of  users,  and  whose  bit-error-rate  is  independent 
of  the  powers  of  the  interferers  at  the  receiver.  Moreover,  the  decor- 
relating  receiver  achieves  optimum  near-far  resistance  (in  a  sense 
to  be  defined  precisely  in  the  sequel).  The  only  requirement  is  the 
knowledge  of  the  signature  waveforms  of  the  interfering  users,  and, 
in  particular,  no  knowledge  of  the  received  energies  is  required,  in 
contrast  to  the  optimum  receiver. 

This  paper  generalizes  the  results  obtained  in  [7]  in  the  case  of 
synchronous  code-division  multiple-access  channels.  Other  recent  at¬ 
tempts  to  derive  detectors  for  multiuser  channels  include  (9]-[l  U- 
The  multiple-access  channel  model  considered  in  this  paper  is 
spelled  out  in  Section  II,  as  well  as  the  general  structure  of  .the 
proposed  detector.  In  Section  III,  we  present  the  performance  mea¬ 
sure  of  interest,  the  near-far  resistance  and  we  show  that  the  near-far 
resistance  of  the  optimum  multiuser  detector  can  be  achieved  by  a 
linear  detector  (the  decorrelating  detector),  which  is  explicitly  ob¬ 
tained  in  Section  IV,  as  well  as  its  implementable  version  as  a  linear 
time-invariant  system.  Section  V  gives  a  numerical  comparison  of  the 
error  probabilities  of  the  decorrelating  receiver  and  the  conventional 
receiver  in  a  scenario  of  practical  interest. 

II.  Multiuser  Communication  Model 
Let  the  receiver  input  signal  be 

r(t)  =S(t,b)  +  n(t)  (2.1) 

where  n(t)  is  white  Gaussian  noise  with  power  spectral  density  a2 
and 

m  k  _ 

S(t,  b)=J2  ~ iT  ~  Tk)  {22) 

l  =  — ,v/*=i 

is  the  element  of  £2  (the  Hilbert  space  of  square-integrable 
functions)  which  contains  the  information  sequence  b  = 
{b(i)  =  [6i(/),-,W01,W/)€{-l,l},t  =  1  = 
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Fig.  1.  Example  of  signature  waveforms  which  can  violate  the  LIA. 

j  -M,---,M},  Sk(t)  is  the  normalized  signature  waveform  of  user 
!  k  and  is  zero  outside  the  interval  [0, 7*],  and  wt(t)  is  the  received 
energy  of  user  k  in  the  /th  time  slot.  Let  N  =  2M  +  1  be  the 
length  of  the  transmitted  sequence.  Without  loss  of  generality  it  is 
assumed  that  the  users  are  numbered  such  that  their  delays  satisfy 
0  < T|  <  <tk  <T.  The  normalized  signal  S(t,b)  is  the  re¬ 
ceiver  input  signal  corresponding  to  unit  energies. 

.  Define  the  vector  space  L  =  {x  =  fr(-Af),  ,x(Af)]  = 

•  -.cxicao.  •  xka) e as, 

|/r  =  1, /  =  -M, ■•■,M},  (each  dement  of  which  can  be 
equivalently  viewed  as  a  sequence  of  N  (/f*l)- vectors  or  as  one 
single  (/VA'*l)-vector),  and  define  the  (k,  t)th  unit  vector  «*•'  in 
L  with  components  uV(/)  =  Note  that  the  set  of  possible 
transmitted  sequences  o  is  a  subset  of  L,  obtained  by  restricting  the 
components  of  the  vector  x  to  take  on  the  values  ±  1.  Let  { ■ ,  • } 
-denote  the  usual  inner  product  on  <£i,  i.e.,  the  integral  of  the  product 
lover  the  region  of  support,  with  induced  norm  ||  ■  ||,  Henceforth,  we 
Intake  the  following  assumption  on  S(t,  b). 

1)  Linear  Independence  Assumption  (LIA): 


Vv  ez,,  v  ?£0=>  ||S(f,v)|j  #0.  (2.3) 

In  other  words,  no  matter  what  the  received  energies  are,  the  re¬ 
ceived  signal  does  not  vanish  everywhere  if  at  least  one  of  the  users 
jfias  transmitted  a  symbol.  This  condition  fails  to  hold  only  in  patho¬ 
logical  nonpractical  cases  with  very  heavy  crosscorrelation  between 
the  signals,  such  as  the  two-user  example  in  Fig.  1.  There  if  the  de¬ 
lay  between  the  users  is  T/2,  the  received  signal  can  be  identically 
zero  although  transmissions  have  been  made  (this  happens  if,  for  all 
i,  b2(i)  =  —6 1  (/)] .  It  is  shown  in  Appendix  II  that  such  a  situation 
will  arise  with  probability  zero  if  the  a  priori  unknown  delays  are 
uniformly  distributed,  which  is  the  case  in  the  asynchronous  channel 
nsed  by  noncooperating  users.  Basically,  in  order  to  violate  the  LIA, 
5  subset  of  the  users  must  be  effectively  synchronous  and  the  mod¬ 
ulating  signals  of  this  subset  have  to  be  heavily  correlated.  The  LIA 
will  be  in  effect  in  the  rest  of  the  paper.  If  it  is  removed  all  the  given 
results  can  be  generalized  in  a  manner  analogous  to  the  treatment  of 
the  synchronous  transmission  case  (7J. 

The  sampled  output  of  the  normalized  matched  filter  for  the  rth  bit 
of  the  Ath  user,  i  =  -M,  ■  •  ■ ,  M,  is 

|  riT +T +r* 

ydi)=  r(t)sk(t  -iT  -rk)dt  (2.4) 

JlT+r, 

fOQ 

=  /  S(t,b)sk(t  -iT  -rk)dt 


+  n(t)sk(t  -  iT  ~rk)dt 


where  the  second  equality  is  valid  since  the  signals  are  zero  outside 
(0,  T).  It  is  well  established  (e.g.,  [1])  that  the  whole  sequence y  of 
outputs  of  the  bank  of  K  matched  filters,  with  components  yk(i)  given 
by  (2.5),  for  k  =  1 ,  •  •  • , K ,  i  =  -M,  ■  ■  • , M,  is  a  sufficient  statistic 
for  decision  on  the  most  likely  transmitted  information  sequence  b . 
Jjhe  multiuser  demodulation  problem  which  needs  to  be  solved  at 
Me  receiver  is  to  recover  the  transmitted  sequence  6  ti  from  the 
Sequence  y  6  L.  Motivated  by  the  state  of  the  art— where  the  choice 
lies  between  the  optimum  multiuser  detector,  which  is  of  exponential 
complexity  and  the  ad  hoc  single  user  detector  whose  performance 
degrades  to  zero  for  sufficiently  high  interference  energy—  we  define 
a  class  of  simple  detectors  and  optimize  performance  within  this 
class,  to  obtain  an  acceptable  error  probability  versus  complexity 
Tradeoff. 


A  linear  detector  for  bit  i  of  user  k  is  characterized  by  v*1'  s  L. 
The  decision  of  the  detector  is  given  by  the  polarity  of  the  inner 
product  of  vk,>  and  the  vector  y  of  matched  filter  outputs,  which  is 
equal  to 

"  *  ,00  _ 

2j  =  /  ,S(/,  wb)S(t,  vk-‘)dt  +  nkJ  (2.6) 

U-MJ=\  J- oo 

=  (S(t,  wb),  S(t,  vkJ))  +  nu  (2.7) 

where  for  any  information  sequence  b ,  wb  will  denote  the  sequence 
of  amplitudes  wb  =  {[yWOMO,  • .  yStWMO],  i  - 
-M ,  •  •  • , M}.  nk>i  is  the  noise  component  at  the  output  of  the  cas¬ 
cade  of  matched  filter,  sampler  and  detector,  hence  is  a  Gaussian 
zero-mean  random  variable  with  variance  given  by 

£Ky]  =  2_,M/)v>(/)  /  a2sk(t -IT-  rk)Sj(t -iT  -  T,)dt 

tj  J -oo 

=  a2  j|S(/,  u*-')||2 .  (2.8) 

The  receiver  decides  on  the  fth  bit  of  the  Ath  user  according  to  the 
rule 


.  bk(i)  =  sgn  ]T  ^«*’'(/)^(0  (2.9) 

ta-M  ;=i 

=  Sgn((5(r,  wb),  S(t,  vkJ))  +  /!*,,).  (2.10) 

Wherever  it  is  clear  from  the  context,  the  superscripts  k,  i  will  be 
omitted. 

2)  Matrix  dotation.  It  is  convenient  to  introduce  the  following 
compact  notation.  Define  the  K*K  normalized  signal  crosscorrela¬ 
tion  matrices  R(l)  whose  entries  are  given  by 


/oo 

sk(t  -Tk)s,(t  +IT  -Tj)dt.  (2.11 

-oo 


Then,  since  the  modulating  signals  are  zero  outside  (0,  T] 

fl(/)=OV|/|>  1,  (2.12) 

R(-l)=Rr(l),  (2.13) 

and,  if  the  users  are  numbered  according  to  increasing  delays,  /?( 1) 
is  an  upper  triangular  matrix  with  zero  diagonal.  Also  let  W(l)  = 
diag([\/wi(/), •  ••,  y/w* (/)]).  With  this  notation  the  matched  filter 
outputs  for  /  =  {-M,  ■  ■  • , M)  can  be  written  in  vector  form  as  (cf., 


y(l)  =R(-l)fV(l  +  1  )b(l  +  1)  +R(OW(l)b(l) 

+R(l)W(l  -  iy»(/  -  1)  +n(l),  (2.14) 

as  can  be  seen  for  each  component  by  inserting  (2.1)  into  (2.4).  We 
adopt  the  convention  that  b(-M  -  1)  =  b(M  +  1)  =  0. «(/)  is  the 
matched  filter  output  noise  vector,  with  autocorrelation  matrix  given 
by 

E]n(i)nTU)i  =  o2R(i-j).  (2.15) 

The  entries  of  the  matrices  R(i),  i  =  -1,  0,  1  are  obtained  at  the 
receiver  by  cross-correlating  appropriately  delayed  replicas  of  the 
normalized  signature  waveforms  according  to  (2.11).  Note  that  no 
additional  complexity  is  hereby  required  of  the  receiver,  since  knowl¬ 
edge  of  the  normalized  signature  waveforms  and  the  capability  to 
lock  onto  the  respective  delays  are  necessary  for  matched  filtering 
and  sampling  at  the  instant  of  maximal  signal-to-noise  ratio 
In  contrast  to  (2.5)  the  asynchronous  nature  of  the  problem  is 
clearly  transparent  in  (2.14).  To  make  this  notation  more  compact 
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Fig.  2.  Equivalent  synchronous  transmitted  sequence. 


we  define  the  NK*NK  symmetric  block-Toeplitz  matrix  <R  and  the 
NK*NK  diagonal  matrix  W,  as  follows: 

(R( 0)  R(- 1)  0  0  \ 

/?(i)  R(  0)  /?(-i)  : 

=  0  /?(1)  R( 0)  0  »  (2-16) 

:  /?(- 1) 

\  o  0  /?(!)  J?(0)  / 

“W  =diag([v/w|(-A/),---,v/wA-(-A/),---, 

y/w,(M).  —  ,\/wK(M)])-  (2-17) 

In  this  notation  the  matched  filter  output  vector y  depends  on  b  via, 
from  (2. 14) 

y=(R'Wb+n.  (2.19) 

The  matrix  (R  can  be  interpreted  as  the  cross-correlation  matrix 
for  an  equivalent  synchronous  problem  where  the  whole  trans¬ 
mitted  sequence  is  considered  to  result  from  N*K  users,  labeled 
as  shown  in  Fig.  2,  during  one  transmission  interval  of  duration 
Tt  -  +tk  -  t | .  Then  the  results  presented  here  for  finite 
transmission  length  can  be  derived  via  analysis  of  synchronous  mul¬ 
tiuser  communication,  as  done  in  [7],  However,  the  approach  taken 
in  this  paper  is  more  general  and  gives  more  insight  into  the  nature 
of  the  problem.  The  limit  N  —  oo  is  considered  in  Section  IV-B. 

The  decision  made  on  the  rth  bit  of  the  fah  user  at  the  output  of 
the  detector  v  is: 


high  SNR)  of  a  transmitter  whose  bit-error-rate  curve  and  energy  are 
given  by  Pk(a)  and  w*,  respectively,  is 


t}k  =  lim 
0— *0 


e*(q) 

w* 


=  sup 


f 0  </■  <  1;  limP*(ff)/e 

I  a  --*0 


(3.1) 


where  the  last  equation  follows  immediately  upon  substitution  of 
P*(o)  by  its  expression  in  terms  of  the  effective  energy.  In  order 
to  visualize  intuitively  the  asymptotic  efficiency,  note  that  the  log¬ 
arithm  of  the  bit-error-rate  Pk(o)  decays  asymptotically  with  the 
same  slope  as  the  logarithm  of  the  bit-error-rate  of  a  single-user 
with  energy  jj*iv*.  Therefore,  if  lima—o  Pk(a)  >  0,  (i.e.,  there  is 
an  irreducible  probability  of  error  even  in  the  absence  of  background 
noise),  then  the  asymptotic  efficiency  is  zero.  Conversely,  nonzero 
asymptotic  efficiency  implies  that  the  bit-error-rate  goes  to  zero  (as 
(j—0)  exponentially  in  1/cr2. 

While  asymptotic  efficiency  and  low-noise  bit-error-rate  are  equiv¬ 
alent  performance  measures,  asymptotic  efficiency  has  the  advantage 
of  being  analytically  tractable  and  of  resulting  in  explicit  expressions 
for  the  detectors  we  are  interested  in.  For  example,  while  the  prob¬ 
ability  of  error  of  the  optimum  multiuser  detector  does  not  admit  an 
explicit  expression,  its  asymptotic  efficiency  is  given  by  [2) 


7?*-,  =  ^(0«^l|5(/’H’e)|12  (32) 

where  Z*  is  the  set  of  error-sequences  e  =  {e(/‘)  6  {-1,  0,  1}*,  /  = 
-M,  e*(/)  =  1 }  that  affect  the  rth  bit  of  the  &th  user.  It  was 

shown  in  [3]  (see  also  [15])  that  the  numerical  computation  of  the 
asymptotic  efficiency  of  optimum  multiuser  detection  given  by  (3.2) 
is  an  NP-complete  combinatorial  optimization  problem. 

In  an  environment  where  the  transmission  energies  change  in  time, 
e.g.,  if  the  transmitters  are  mobile,  a  performance  measure  of  interest 
for  any  detector  is  its  £th  user  near-far  resistance,  rjff,  which  is 
defined  for  each  detector  as  its  worst  case  asymptotic  efficiency  for 
bit  /  of  user  k  over  all  possible  energies  of  the  other  (interfering  and 
noninterfering)  bits,  i.e., 


Vk.i  =  inf  *,/•  (3.3) 

»;(/)>  0 

u.iyoc.i) 


bkO)  =  sgnvry  =  sgnvr((R'WZ>  +n).  (2.20) 

As  for  the  inner  product,  for  allx.y  in  L 

{S(t,x),Sft,y))=xT<Ryf  (2.21) 

It  can  be  seen  from  (2  21)  and  from  (2.3)  that  <R  is  positive  definite. 

III.  Near-Far  Resistance 

The  main  performance  measure  we  are  interested  in  is  the  bit¬ 
error-rate  in  the  high  signal-to-background  noise  region.  Thus,  even 
though  the  background  thermal  noise  is  not  neglected,  our  main  focus 
will  be  on  the  underlying  performance  degradation  due  to  multiple- 
access  interference.  This  performance  degradation  is  conveniently 
quantified  by  the  asymptotic  efficiency  which  was  introduced  in 
[l]-[2],  and  is  defined  as  follows.  Let  P*(< 7)  denote  the  bit-error-rate 
of  the  Arth  user  when  the  spectral  level  of  the  background  white  Gaus- 
sian  noise  is  <7:,  and  let  e*(cr)  be  such  thatP*(<7)  =  Q(^/e*(<7)/<7 ).' 

Then,  e*(<r)  is  actually  the  energy  that  the  ifcth  user  would  require 
to  achieve  bit-error-rate  Pk(c)  in  the  same  white  Gaussian  channel 
but  without  interfering  users.  Hence,  we  refer  to  ek(a)  as  the  ef¬ 
fective  energy  of  the  Arth  user,  and  the  efficiency  or  ratio  between 
the  effective  and  actual  energies  e*(<7)/w'/t  is  a  number  between  0 
and  1  which  characterizes  the  performance  loss  due  to  the  existence 
of  other  users  in  the  channel.  Thus,  the  asymptotic  efficiency  (for 

'  Q(x)  =  /~(1  lfZ^)eWlidv. 


In  our  definition  of  near-far  resistance  we  model  the  most  gen¬ 
eral  case  where  the  energies  of  the  users  are  allowed  to  be  time- 
dependent.  This  captures  the  worst  case  operating  conditions  of  the 
detector,  which  are,  for  example,  encountered  in  mobile  radio  com¬ 
munication,  due  to  positioning  and  tracking  variations.  In  the  case 
where  the  energies  are  constrained  to  be  arbitrary  but  nonvarying 
the  present  near-far  resistance  is  a  lower  bound.  That  case  is  not 
amenable  to  closed-form  analysis,  since  one  has  to  deal  with  a  com¬ 
binatorial  optimization  problem. 

For  illustration  consider  the  two-user  case.  If  the  user  energies  are 
constant  over  time,  i.e.,  w,(i)  =  w, ,  w2(i)  =  w2,  the  asymptotic 
efficiency  of  the  optimal  multiuser  detector  given  by  (3.2)  reduces  to 
[21: 


f  ^2  1 

=  mhW  1,  1  + - 2  max  {bi2|,|P2i|}- 

[  W\  1 


1  +2£;  ”2(M  +  M)-^ 


and  hence 


1? mm  =  min  i),  =min{l  -  p]2,  1  —  p|i  , 
w2 

Vi  const. 

,  2  .2  ,  (|Pt2 1  —  |P2I  I)2 

1  “P|2  -P2I  +  - j - 


j,  (3.4 
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achieved  by  the  linear  detector  v  for  the  hh  bit  of  the  fcth  user  is 


V- 


—  optimum  multiuser  cetector 

—  CONVENTIONAL  SINGLE-USER  OCTECTOR 


—  inf  min  S 
»j(b> o  «ez»  \ 
u.i)¥(k.n  11 

=  M  Iisa.jon2. 

Xit/)-! 


%/mo 


»?*.r(v)  = 


0,  min 

b  €R 
**<0=1 


(S(t,Wb),S(t,V)) 

l|S(/,v)|| 


«-{klKI}  f  ,/(|pi2  I+|p2i  I )  ^ 

Fig.  3  Asymptotic  efficiencies  in  the  two-user  case  for  infinite  transmitted 
sequence  length,  when  the  user  energies  are  constant  over  time  (here  we 
chose  |pu|,  1^,1  =03,  0.5). 

and  analogously  for  user  2  where  pi2  =  f?|2(0)  and  pi\  =  R t2(l). 
The  dependence  of  rji  for  constant  energies  on  the  energy  ratio  is 
shown  in  Fig.  3.  Note  that  the  opon.ul  multiuser  detector  is  near- 
far  resistant,  and  in  fact  has  an  asymptotic  efficiency  of  unity  for 
sufficiently  powerful  interference  ([2]).  Note  also  that  in  this  case 
three  different  error-sequences  minimize  (3.2)  for  different  values  of 
h-2/wi  ,  as  can  be  seen  from  the  discontinuity  points  of  the  derivative 
of  ij.  The  minimum  of  ij  over  constant  enerr  ss,  tj  m;n ,  is  an  up¬ 
per  bound  on  the  near-far  resistance  of  optnr  an  multiuser  detection 
ij,  which  is  the  minimum  asymptotic. efficiency  over  unconstrained 
energies. 

The  near-far  resistance  of  the  optimal  multiuser  detector  is  im¬ 
portant  since  it  is  the  least  upper  bound  on  the  near-far  resistance  of 
any  detector,  and  a  measure  of  the  relanve  performance  of  any  sub- 
optimal  detector,  from  (3.2)  and  the  definition  of  near-far  resistance 
it  is  equal  to 


jjk.i  -  inf  — rr-min  ||S(/,M'€)||2 

*,( d>o  M0«ez* 

U.Osfic.O- 


Knowledge  of  the  asymptotic  efficiency  of  a  linear  detector  is  equiv¬ 
alent  to  knowledge  of  the  worst  case  probability  of  error  over  the 
bit  sequences  of  the  interfering  users,  since  this  error  probability, 
which  is  a  Q-function,  is  set  equal  to  <2(\A*-  ,-(v)iv*(/)/<r)  to  obtain 

(3.10) . 

For  illustration  consider  the  conventional  single-user  detector  in 
the  two-user  case.  We  have  v  =  uk,i  (recall  that  uk,i  is  the  (k,  /)th 
unit  vector  in  the  space  L  of  linear  detectors).  If  the  user  energies 
are  constant  over  time,  i.e.,  W| (/)  =  W|,  w2(0  =  w2,  the  asymp¬ 
totic  efficiency  of  the  conventional  single-user  detector  is  found  from 

(3.10)  to  be 


i)\  =  max 


|o,  1  —  (|pu| 


+  kiD- 


and  analogously  for  user  2.  The  dependence  of  Tjf  for  constant  ener¬ 
gies  on  the  energy  ratio  is  shown  in  Fig.  3.  Note  that  the  asymptotic 
efficiency  of  the  conventional  single-user  detector  is  zero  fo:  suf¬ 
ficiently  high  interference  energy  (v/w2/v'vv7  >  1/Opu!  +  1P21  !))- 
This  implies  that  its  near- far  resistance  is  zero,  which  is  wtiat  we 
want  to  remedy. 

There  are  three  quantities  of  interest  in  this  communicaticn  envi¬ 
ronment,  on  the  one  hand  the  transmitted  bit-sequence  and  the  set  of 
energies,  both  of  which  depend  only  on  the  transmitters  and  deter¬ 
mine  the  operating  points  for  the  receiver,  and  on  the  other  hand 
the  data-processing  detector  v  at  the  receiver,  which  we  called  a  lin¬ 
ear  detector.  In  determining  which  linear  detector  to  choose  at  the 
receiver  a  useful  proced  e  is  the  minimax  approach,  in  which  the 
design  goal  is  to  optim.  he  worst  c?‘  e  performance  of  the  receiver 
over  the  class  of  operating  points.  Thus  we  are  interested  in  finding 
the  maximin  linear  detector,  whose  worst  case  performance  over  all 
allowable  input  sequences  is  the  highest  in  the  class  of  linear  detec¬ 
tors.  The  following  result  quantifies  the  performance  of  the  maximin 
detector,  in  the  sequel  denoted  by  v*. 

Proposition  1:  There  exists  a  linear  detector  (which  is  indepen¬ 
dent  of  the  received  energies)  that  achieves  optimum  near-far  resis¬ 
tance  (i.e.,  the  near-far  resistance  of  the  optimum  multiuser  detec¬ 
tor).  • 

Proof:  Ftom  (3.10)  the  asymptotic  efficiency  of  the  linear  de¬ 
tector  v  is 


In  Section  IV,  we  obtain  a  closed-form  expression  for  (3.7)  as  the 
reciprocal  of  tile  (Ar,  /)th  diagonal  element-(see  footnote  2)  of  the 
inverse  of  (R.  Hence,  the  near-far  resistance  of  optimum  multiuser 
resistance  is  guaranteed  to  be  nonzero  because  of  the  linear  indepen¬ 
dence  assumption  of  (2.3),  which  ensures  that  (R  is  invertible. 

We  now  mm  to  the  performance  analysis  of  the  linear  detectors 
introduced  above.  The  probability  of  error  at  decision  upon  M0  of 
the  linear  detector  v  is,  from  (2. 10): 

P*0)=P(&k(i)?bk(i))  (3.8) 

=  P((S(t,wb),S(t,v))  +  nkJ  <0|M0  =  1).  (3.9) 

The  equality  follows  since  the  hypotheses  +1,-1  are  assumed 
equally  likely.  Let  B  be  the  set  of  possible  transmitted  sequences. 
From  (2.8)  nkj  is  a  zero-mean  Gaussian  random  variable  with  vari¬ 
ance  ff2||S(f,  v)||2,  hence  the  probability  of  error  in  (3.9)  is  a  sum  oi 
Q- functions,  one  for  each  possible  interfering  bit-combination.  For 
o  -♦  0  the  Q- function  with  the  smallest  argument  dominates  the  er¬ 
ror  probability,  hence  from  (3.1),  since  the  expression  below  can  be 
shown' (cf.  [15])  to  be  upper  bounded  by  1,  the  asymptotic  efficiency 


n*,/(v)  =  — —  max2  <  0,  mm 
I  »<=a 
1  m»=i 


(S(t,wb),S(.t,v)) 

l|5(/,v)|| 


x  min  — —  max 
4  6B  M0 


=  mm  — ttt  max' 
ses  M0 
**<o=i 


where  in  the  last  equality  v/e  have  used  the  compact  matrix  notation 
of  (2.21)  for  simplicity.  We  are  interested  in  the  linear  detector  with 
the  highest  worst  case  asymptotic  efficiency,  i.e,,  whose  near-far 
resistance  is 


i?  *,/(u*)=  sup  inf 

vQL  wj0)> 0 

|5</,v)J  ,<0 

=  sup  inf  min  — —  max2 

v-et  w/(')>o  sea  M0 

.r«W0  (j.t)¥(k,i) 


fn  jb7’'W<Rv'| 


500 


IEEE  TRANSACTIONS  ON  COMMUNICATIONS,  VOL.  38.  NO.  4  APRIL  1990 


=  sup  inf  max2 
v£L  r€L 
/< Rvyt)  >*<')=' 


yr(Rv  } 
vV^CRv  j 


=  max 


sup  inf 
-st.  y^ 
■r<Hvyo 


yr<Ry 

%4>r(Rv 


(3.17) 


(3.18) 


where  we  have  set  yj(l)  =  bfl)s/w /(/)/  v/w*(/)  for  the  third  equal¬ 
ity.  Let  A/^.y)  denote  the  penalty  function  >,r<Rv'/vv7'(Rv  where 
the  first  argument  is  from  the  set  of  detectors  and  the  second  from 
the  set  of  operating  points,  both  specified  in  (3.18).  We  show  in 
Appendix  1  that  M(y,y )  has  a  saddle  point,  i.e., 


sup 

v  £L 


inf 


-6i 


yr(Rv 

\4>r(Rv 


inf 


yP.L 

>*(>)=! 


sup 
-S  L 
vT<S{v^0 


yr<Rv 
s/vT<Rv ' 


(3.19) 


which  establishes  the  existence  of  v  *  and  hence 


’?*./(v*)  =  max2  < 


„  .  c  yr<Rv 

0,  inf  sup  -== 

y^L  v£L  vvr(Rv 


>  (3.20) 


:  max2  <  0,  inf  \/yr<Ry 

*  y<cL 
ytO)= i 

=  inf  ||S(f,y)||2 

y€L 

I 


(3.21) 

(3.22) 

(3.23) 


where  the  second  equality  is  obtained  in  (A.  1),  the  third  line  follows 
since  (R  is  nonnegative  definite  and  the  last  equality  was  obtained  in 
(3.7).  0 

The  reason  why  the  near-far  optimum  linear  receiver  achieves  the 
same  near-far  resistance  as  the  optimum  receiver  can  be  understood 
as  follows.  Let  Cl  be  the  set  ofmultiuser  signals  modulated  by  all 
positive  amplitudes,  i.e.,  Cl  —  [S(t,y),  y  €  L)  and  let  E  denote  the 
subset  of  Cl  such  that  the  amplitude  of  the  fth  symbol  of  the  £th  user 
is  fixed  to  1,  i.e.,  E  =  {S(/,y),  y  €L,  yk(i)  =  1}  (note  that  E  is 
a  convex  set,  and  because  of  the  LIA  it  does  not  include  the  origin). 
Since  the  penalty  function  in  (3.18)  is  invariant  to  scaling  of  v  and 
the  operator  (R  is  positive  definite,  (3.18)  can  be  rewritten  as 


Vk.i(.v *)  =  max*  <  0, 


sup  inf  (S(t,y),S(t,v)) 

»ei  y&L 


(3.24) 


:  max*  <  0,  sup  inf  (y,  u) 

<,6fi 

1-1=' 


(3.25) 


Therefore  the  &th  user  deconelating  filter  can  be  viewed  as  the  unit- 
norm  multiuser  waveform  whose  minimum  inner  product  with  the 
elements  of  E  is  highest.  But  since  E  is  a  convex  set,  that  signal  is  a 
scaled  version  of  the  closest  vector  in  E  to  the  origin  (Fig.  4),  and 
its  near-far  resistance  (cf.  (3.22)]  is  the  norm  squared  of  that  vector. 
But,  as  (3.7)  indicates,  the  square  of  the  distance  from  E  to  the  origin 
is  precisely  the  near-far  resistance  of  the  optimum  detector. 

Equation  (3.7)  leads  to  a  nice  intuitive  interpretation  of  near-far 
resistance.  Rewrite  this  equation,  using  the  definition  of  S(t,  •),  as 


2 


»/*./  = 


inf 


\sk{!  - iT  -  7>)  +  -IT  - r/)J 

y. /;/(*,/> 


(3.26) 


Fig.  4.  Interpretation  of  near-far  resistance.  Vector  in  boldface  corresponds 
to  deconelating  filter. 

Letting  (y/(/)}  vary  over  the  admissible  set,  the  second  term  above 
generates  all  points  of  a  linear  subspace  which  includes  the  origin, 
therefore  the  infimum  in  (3.26)  is  the  distance  of  sk(t  -  iT  -  rk)  to 
this  space,  i.e., 

Wj  =d1(h0 -IT -rk),  span  {§j(t  -IT  - rj),  {J,  /)  ^  (k,  /)})  , 

1  (3.27) 

where  d(a,  b)  denotes  the  Euclidean  distance  between  the  JSj  ele¬ 
ments  a  and  b.  In  the  synchronous  case  because  the  time-support  is 
disjoint,  the  infimum  in  (3.26)  is  achieved  when  yj(l)  =  0,  /  i, 
and  (3.27)  reduces  to 

Vk  =  d2(sk(t),  span  {Sj(t),  j  f  A:}),  (3.28) 

i.e.,  the  Arth  user  near-far  resistance  in  a  synchronous  channel  is  the 
square  of  the  distance  of  the  kth  user  signal  to  the  space  spanned 
by  the  signals  of  the  interfering  users.  Viewing  the  asynchronous 
problem  in  terms  of  the  equivalent  synchronous  system  with  N*K 
users  and  period  NT,  the  near-far  resistance  of  asynchronous  com¬ 
munication.  allows  for  the  same  interpretation.  Note,  however,  that 
the  shifted  versions  sk(t  -IT  -rk),  l  9^  /  of  the  Ath  user  signal  affect 
the  near-far  resistance  of  the  rth  symbol  of  user  k. 

The  following  section  characterizes  the  linear  detector  that  achieves 
the  optimum  near-far  resistance  anticipated  by  Proposition  1 . 

IV.  The  Decorrelating  Detector 

We  first  assume  N  to  be  finite,  as  in  the  case  in  all  communica¬ 
tion  environments,  and  characterize  the  linear  filter  vhich  achieves 
the  near-far  resistance  of  optimum  multiuser  detection.  This  filter  is 
nonstationary  for  finite  N.  The  limit  as  /V  —>  00  is  then  considered, 
yielding  a  stationary  noncausal  limiting  filter,  and  hence,  after  ap¬ 
propriate  truncation  of  the  noncausal  part,  an  approximation  of  the 
near-far  optimal  linear  filter  which  can  be  implemented  easily 

A .  The  Finite  Sequence  Length  Case 

Definition-.  A  decorrelating  detector dk,‘  for  the  rth  bit  of  the  Alh 
user  is  a  linear  detector  for  which 

<Mk-‘  =  uk-‘  (4.1) 

or  equivalently,  from  (2.21),  ( S(t ,  v),  S(t,  dk,i))  =  uk(i),  for  all  v 
in  L. 

Existence:  By  the  LIA,  statement  (4.2)  below  holds  for  all  k,  i. 
Hence,  the  following  equivalences  show  the  existence  of  the  decor¬ 
relating  detectors  for  each  bit  of  each  user. 

V  v  e  L  with  ujt(i)  ^  0:  ||S(f ,  v)||  0  (4.2) 

with  vk(i)  £  0:  vT<Rv  ^  0  (4.3) 

&  flv  &L  with  vk(i)  54 0  s.t.  (Rv  =  0  (4.4) 

<?>  the  (k,  /)'*  column2  of  (R  is 

2  We  refer  to  the  ( k ,  <)th  row  (or  column)  of  a  matrix  of  the  dimension  of  (R 
when  we  vant  to  name  the  kth  row  (or  column)  within  the  Ah  block  in  vertical 
(horizontal)  direction.  This  notation  was  adopted  since  <R  is  block-Toeplitz. 
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x  linearly  independent  of  the  others  (4.5) 
<=* 3<f  s.t.  (Rd  =uk'!.  (4.6) 

Properties: 

i)  The  decorrelating  detector  for  each  bit  of  each  user  is  invariant 
with  respect  to  received  energies  and  does  not  require  knowledge 
thereof. 

Proof:  Since  the  elements  of  the  matrix  (ft  are  normalized 
crosscorrelation  coefficients,  the  defining  equation  (4.1)  is  energy 
independent. 

ii)  The  decorrelating  detector  eliminates  the  multiuser  interference 
present  in  the  respective  matched  filter  output.  (Hence  its  name). 

Proof:  From  (2.20)  the  decision  made  on  the  rth  bit  of  the  fah 
user  at  the  output  of  the  decorrelating  filter  d  is, 

£*0')=sgn(rfr(R'W6  +drn) 

=  sgn(v/  wk(i)bk(i )  +dTn).  (4.7) 


Interestingly,  this  natural  strategy,  though  not  necessarily  optimal  for 
specific  user-energies,  is  optimal  with  respect  to  the  worst  possible 
distribution  of  energies. 

iii)  The  fall-user  bit-error-rate  of  the  decorrelating  detector  is 
independent  of  the  energies  of  the  interfering  users  w,(:j,  j  ^  k, 
i  = 

Proof:  It  follows  from  (4.7)  that  the  decision  statistic  that  is 
compared  to  a  zero  threshold  is  independent  of  the  energies  of  the 
interfering  users. 


iv)  The  efficiency  of  the  decorrelating  detector  is  independent  of 


energies  and  is  given  by 

4,i  -  max2  | 

L  . 

0,  min 
|  sea 

**(<)=■  i 

1  (S{t,Wb),S{t,d))\ 

v/mo  m,d)\\  j 

■  (4.8) 

=  max2  | 

^0,  min 

I  sea 

ft*  (/>«*• 

l  vWoMOl 
vWo  \M(0  j 

(4.9) 

1 

“*(/)’ 

(4.10) 

which  by  i)  is  energy-independent. 

v)  The  decorrelating  detector  is  the  worst  case  optimal  linear  de¬ 
tector,  and  achieves  the  near-far  resistance  of  optimum  multiuser 
detection. 

Proof:  The.  proof  of  Proposition  1  is  constructive,  hence  the 
first  part  of  v)  was  obtained  as  a  byproduct  in  Appendix  1 .  Here  is 
a  shorter  proof,  using  the  following  fact.  Any  single  linear  strategy 
which  is  not  decorrelating  has  a  near-far  resistance  of  zero.  This  ic 
shown  as  follows.  The  near-far  resistance  of  a  linear  filter  is  (cf. 
(3.18)): 


{n(0} 

Fig.  5.  Equivalent  communication  system. 


and  outlines  an  alternative  proof  for  Proposition  1 :  we  could  have 
explicitly  solved  the  above  optimization  problem  by  proceeding  along 
the  same  lines  as  in  Appendix  1 ,  postulated  the  decorrelating  detector 
by  reasoning  as  in  F%tct  under  v),  and  shown  that  the  asymptotic 
efficiency  of  the  decorrelating  detector  and  the  near-far  resistance  of 
optimal  multiuser  detection  are  equal  (see  [7]).  However,  the  game 
theoretic  proof  provides  more  insight  into  the  nature  of  the  solution. 

Property  iii)  is  of  special  importance.  By  this  property  the  decor¬ 
relating  detector  does  not  become  multiple-access  limited,  no  matter 
how  strong  the  multiple-access  interference  is.  Also  the  decorrelating 
detector  demodulates  the  data  perfectly  in  the  absence  of  noise,  as 
can  be  seen  from  (4.7). 

Characterization:  We  would  now  like  to  find  an  explicit  expres¬ 
sion  for  the  decorrelating  detector  which  we  have  up  to  now  defined 
implicitly.  It  follows  immediately  from  (4.1)  and  the  uniqueness  of 
the  inverse  of  an  invertible  matrix  that  the  decorrelating  detector  for 
the  rth  bit  of  user  k  is  the  (£,  ;)th  row  of  the  inverse  of  (ft. 

From  the  above  and  (4. 10)  the  asymptotic  efficiency  of  the  decor¬ 
relating  detector  for  the  rth  bit  of  user  k  is  given  by  the  ( k ,  t)th 
diagonal  element  of  the  inverse  of  (ft: 

4,t  =  -r=r - •  (4-13) 

(Ar./) 

For  the  values  of  N  encountered  in  practical  applications,  inverting  a 
NK*NK  matrix  is  not  possible.  This  issue  is  addressed  in  Section 
IV-B  where  we  represent  the  decorrelating  detector  as  a  AT-input  K- 
output  time-varying  linear  filter,  and  then  show  that  in  the  limit  as 
N  tends  to  infinity  the  filter  becomes  time-invariant. 

B.  The  Limiting  Case  N  —  oo 

Proposition  2:  As  the  length  of  the  transmitted  sequence  in¬ 
creases  ( N  -»  oo)  the  decorrelating  detector  approaches  the  /(-input 
/C-output  linear  time-invariant  filter  with  transfer  function 

G(z)  *  fffr(l)z  +/?( 0)  +A(l)z-']"1.  (4.14) 

*  • 

Proof:  From  (2.14)  and  (2.13)  the  matched  filter  outputs  for 

/  =  {-M,--- ,M}  are 

y{l)  =Rr(lW(l  +  1  )b(l  +  1)  +R(0)H'(/)b(l) 

+R(lW(.t -l)b(! -l)+n(l)  (4.15) 


Vk,i(v)  =  max2  i  0,  inf  i  .  (4.11) 

\  M (?>-.  j 

Unless  (Rv  =uk,‘  (note  invariance  of  y  to  scaling  of  v)  the  value  of 
the  inf-term  is  -oo.  Hence  any  linear  filter  which  is  not  decorrelating 
has  a  near-far  resistance  ij  =0.  This  fact  together  with  the  nonzero 
asymptotic  efficiency  (4.10)  of  the  decorrelating  detector  establish 
optimality  of  the  decorrelating  detector  within  the  class  of  linear 
filters.  Therefore  the  second  part  of  v)  results  from  Proposition  1. 

Note  that  since  the  asymptotic  efficiency  of  the  decorrelating  de¬ 
tector  is  independent  of  energies  (Property  iv)  it  equals  the  near-far 
resistance.  This  gives  us  an  explicit  solution  for  the  Hilbert  space  op¬ 
timization  problem  we  obtained  for  the  near-far  resistance  of  optimal 
multiuser  detection  in  (3.7),  namely, 


hk.l  —  vtj 


1 

dkO) 


(4.  >.) 


where  b(-M  -  1)  =b(M  +  1)  =  0.  Taking  z-transforms  and  letting 
N  go  to  infinity  we  have 

Y(z)  =S(zWB](z)  +  N{z)  (4.16) 

where  ffVBKz)  is  the  ^-transform  of  the  sequence  wb  = 
{[  \/wi(i)b ,  (/),-•• ,  s/wK(i)b K  (/)]},  the  matrix  5(z)  is 

S(z)  =RT(l)z  +/?(0)  -i-/?(  1)z_i  (4.17) 

and  Y(z),  B(z)  and  N(z)  are,  respectively,  the  vector-valued  z- 
transforms  of  the  matched  filter  output  sequence,  the  transmitted 
sequence,  and  the  noise  sequence  at  the  output  of  the  matched  fil¬ 
ters.  S(z )  can  be  interpreted  as  the  equivalent  transfer  function  of 
the  multiuser  communication  system  between  transmitter  and  deci¬ 
sion  algorithm,  as  illustrated  in  Fig.  5.  In  this  setting  the  optimal 
receiver  problem  is  to  find  the  transfer  function  matrix  G(z)  of  a 
/'-input  /(-output  linear  time-invariant  filter,  at  the  output  of  which 
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{"  (')} 

Fig.  6.  Interpretation  of  the  decorrelating  detector. 


Hence,  for  a  zero  output  sequence yn  the  vector  H(eJu)X(eJu)  has 
to  vanish  for  all  w,  which  implies  that  //(e;")  is  singular  whenever 
X(e*u)  is  nonzero.  This  establishes  the  claimed  equivalence. 
Proposition  4:  Condition  (4.19)  of  Proposition  3  is  equivalent  to 


.min  (**/?( 0*  -  J(x'R+x)2  +  (x'R-x)2)  >  0  (4.20) 

X*X  =  1 

*ee 


a  sign-decision  yields  estimates  of  the  transmitted  sequence  which 
are  optimal  in  a  certain  sense.  In  our  case  the  optimality  criterion  is 
the  near-far  resistance,  and  we  have  demonstrated  that  the  optimal 
filter  is  the  decorrelating  fiber,  which  is  the  filter  that  eliminates  the 
multiuser  interference,  i.e.,  is  the  A'-input  AT-output  time  invariant 
linear  filter  which  recovers  the  transmitted  data  in  the  absence  of 
noise.  Its  transfer  function  is  therefore  the  inverse  of  the  equivalent 
transfer  function  S(z): 

C(z)  =  [S(z)r'.  (4-18) 

❖ 

The  effect  of  the  inverse  filter  [SXz)]-1  can  be  interpreted  as  illus¬ 
trated  in  Fig.  6.  The  decorrelating  filter  can  be  viewed  as  the  cascade 
of  a  finite  impulse  response  filter  with  transfer  function  adjoint  S(z), 
which  decorrelates  the  users,  but  introduces  intersymbol  interference 
among  the  previously  noninterfering  symbols  of  the  same  user,  and 
of  a  second  filter,  consisting  of  a  bank  of  K  identical  filters  with 
transfer  function  [det  .S(z)]-' ,  which  removes  this  intersymbol  inter¬ 
ference.  Whereas  the  region  of  convergence  of  the  z-transform  can 
always  be  chosen  so  as  to  make  S(z)  invertible,  attention  has  to  be 
paid  to  the  issue  of  stability. 

Proposition  3:  There  is  a  stable,  noncausal  realization  of  the 
decorrelating  detector,  if  and  only  if  the  signal  cross-correlations 
are  such  that 

detSfO  =  det[/?r(l)e'“  +R( 0)  +/?(l)e-'u]  £  0,  V«  €  (0,  2x]. 

(4.19) 

• 

Proof:  As  long  as  det  5(z)  has  no  zeros  on  the  unit  circle,  a 
nonempty  convergence  region  of5~‘(z)  can  be  chosen  which  in¬ 
cludes  the  unit  circle.  Thus,  stability  can  be  achieved.  But,  since 
/?( 0)  is  symmetric, 


where/?+  =  /?r(l)+/?(l)  and/?-  =y(/?r(  1)  — -i?(  1)).  The  *  denotes 
the  complex  conjugate.  • 

Note  that  both/?+  and/?-  are  Hermitian.  The  proof  of  Proposi¬ 
tion  4  is  given  in  [15],  together  with  the  following  two  results. 

—A  necessary  condition  for  (4.20)  is  that  the  matrices  /?( 0)  4- 
/?(1)  +RT(  1)  and/?(0)  -/?(1)  -/?r(i)  be  nonsingular. 

—  A  sufficient  condition  for  (4.20)  is  that 


>4in(K(0»  >  max  +)»  +)} 

The  following  results  quantify  the  asymptotic  efficiency  achieved  by 
the  limiting  decorrelating  detector. 

Proposition  5:  Let 

CO 

PUT1  =  D{m)z~m.  (4.21) 

mta— 00 

Then  the  asymptotic  efficiency  of  the  limiting  decorrelating  detector 
for  the  Mi  user  is  given  by 


„d  _ 

V  *  — 


1 

DU(  0) 


+Z?(0) +/?(l)e-'“]*7' rfw 


(4.22) 


(4.23) 


Proof:  From  Proposition  2  the  z-transform  of  the  decision 
statistic  at  the  output  of  the  limiting  decorrelating  detector  is  given 
by 


G(z)T(z)  =  [WB){z)  +  P(z)r'/V(z)  =  [WB](z)  +  N'(z) 


deti'(z)  =  detSr(z)  =  detSfz-1). 

Hence,  the  stable  version  of  the  decorrelating  detector  will  be  non¬ 
causal.  (As  a  side  remark,  the  matrix  S(eJa)  is  nonnegative  define 
for  all  oi,  cf.  [15]).  <)> 

Condition  (4.19)  is  equivalent  to  the  limit  of  the  LIA  as  N  —*  co. 
Both  are  necessary  and  sufficient  conditions  for  system  invertibility. 
The  LIA  requires  that  the  output  of  a  system  (the  system  between 
the  user  bit-streams  and  the  matched  filter  outputs)  not  be  identically 
zero  if  the  input  is  nonzero.  Hence  different  inputs  generate  differ¬ 
ent  outputs,  i.e.,  the  system  is  invertible.  For  a  linear  system  the 
requirement  that  nonzero  input  produce  nonzero  output  is  equivalent 
to  requiring  that  the  transfer  matrix  be  nonsingular  on  the  unit  circle. 
Assume  the  transfer  matrix  is  singular  at  the  angular  frequency  wo. 
Necessity  follows  since  otherwise  the  input  sequence  consisting  of 
a  complex  exponential  at  w0  times  a  vector  in  the  nullspace  of  the 
transfer  matrix  evaluated  at  wo  yields  zero  output;  since  the  trans¬ 
fer  function  on  the  unit  circle  gives  the  magnitude  and  phase  of  the 
system  response  to  complex  exponentials.  On  the  other  hand,  suf¬ 
ficiency  can  be  established  by  using  Parseval's  relation  extended  to 
multivariable  systems: 


where  N'(z)  is  the  z-transform  of  the  (stationary)  filtered  Gaussian 
background  noise  vector  sequence.  The  z-transform  of  its  covariance 
matrix  sequence  E]/i\  -)n/r(  ■  +/)]  is  equal  to  <r2[5(z)]-1,  hence 
with  (4.21)  n'k  is  a  zero-mean  Gaussian  random  variable  with  vari¬ 
ance  a2Dict( 0).  Therefore,  the  probability  of  error  for  the  Mi  user 
equals 


Pk  =  P(n'k  >  y/Wk)  =  Q 


'y/DkMj 


(4.24) 


From  here,  using  the  definition  of  asymptotic  efficiency,  the  first 
equality  follows.  For  the  second,  note  that  applying  the  inverse  z- 
transform  and  definition  (4.21),  we  obtain 


Dkk(0)  =  ~J  View'd «, 

and  the  result  follows  using  (4. 17).  <> 

Proposition  6:  The  asymptotic  efficiency  of  the  limiting  decorre¬ 
lating  detector  for  the  Mi  user  is  strictly  positive,  and  lower  bounded 

by 


=  -00 

=  ± \\H(eJu)X(en\\2  du. 


Vi  2 


max  |[/?7'(l)e'“  +/?(0)  4-R(\)e~n^  I 

u€|0,2x|  1  ’ 


>0. 

(4.25) 
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Proof:  From  (4.22),  (4.23) 


Dkk(0)<  max  |[Rr(i)e'u  +J?(0)  +/?(l)c-y"]^' I .  (4.26) 


Hence, 


’’‘‘“Km-  [„aK„ I**''0*''" +S(0)  +*<1)e'"i» I 

min  |det(Rr(l)e;“  +R( 0)  +/?(l)e~J,“]| 

>  _j; _  (4  ■ 

~  max  |adj*  {RT(l)eJa  +R( 0)  +fl(l)e"'“]|’ 


which  is  positive  by  Proposition  3.  <> 

Proposition  7:  In  the  two-user  case  let  i?l2(0)  =  pi2  and/?12(l)  = 
P21  •  Then  the  asymptotic  efficiency  of  the  decorrelating  detector  for 
infinite  sequence  length  is  given  by 

/n  _  A  _  „2  \2  _An2  „2 


vt  =m  =  vO  ~p\i  —  pit)2  —  4Pi2^it 

=  v/[l  -(P.2  +P2,)2][1  -(p,2 -AS.)2]-  (4.28) 


i 

\  1tain  -  'N\'’ 

i  ’!a - — 


- optimum  multiuser  detector 

- CONVENTIONAL  SINGLE-USER  DETECTOR 

- OECORRELATING  OETECTOR 


2mm  {|p,2 1  •|/>2l  1}  {  ''(k|+M) 

Mkl-U} 

Fig.  7.  Asymptotic  efficiencies  in  the  two-user  case  for  infinite  transmitted 
sequence  length,  when  the  user  energies  are  constant  over  time  (here  we 
chose  |pi2|,  |p2i|  =0.3,  0.5  which  yields  r)min  =0.68,  vf  =0.59) 


unforced  linear  dynamic  system 


1  -  Pi2  ~P2| 


*^7T  —  1  »  ^1  — 


1  “  P|2 


Proof:  This  formula  can  be  obtained  by  particularizing  Propo¬ 
sition  5  or  by  minimizing  the  asymptotic  efficiency  of  optimal  mul¬ 
tiuser  detection  in  the  two-user  case  with  respect  to  energies.  Alter¬ 
natively,  we  will  prove  (4.28)  by  taking  the  limit  as  N  -*■  oo  of  the 
i  asymptotic  efficiency  of  the  decorrelating  filter  for  the  central  bits  in 
[  a  length  N  sequence.  We  will  then  have  proved  that  in  the  two- 
user  case  the  limit  of  the  asymptotic  efficiency  of  the  finite-length 
decorrelating  detector  as  N  -*  oo  is  indeed  the  asymptotic  efficiency 
of  the  limiting  decorrelating  detector. 

Recall  that  the  asymptotic  efficiency  of  the  decorrelating  detector 
is  given  by  the  reciprocal  of  the  corresponding  diagonal  element  of 
<R_I.  We  need  to  find  explicit  expressions  for  the  central  diagonal 
I  elements  of  the  inverse  of  the  matrix  61  as  a  function  of  N.  We  have 

/  l  Pi2  0  0  \ 

P|2  i  P21  0 

(R  =  0  p21  1  P12  .  (4.29) 

1  0  0  pi2  1 


Denote  by  A„  the  determinant  of  the  above  n*n  matrix.  It  is  easy 
to  see  from  the  ft  •  ,.\e  of  61  that  A„  satisfies  the  recursion 


An  —  An—  - 


P?2  An -2,  neven 
P21  Art-2,  n  odd. 


Hence,  we  can  write 


1  —  P?  2  “P21 


The  eigenvalues  of  this  system  are  found  to  be 

n  1  —  P?2  —  P21  th  x/(  1  “  p]2  ~  P2i)2  —  4pj2P.:i 

A|  ,2  =  -  2  ' 

We  see  0  <  X|  <  X2  <  1.  After  finding  the  corresponding  eigenvec¬ 
tors  it  follows  that: 

Xi +Pi2  X24-p2(  X"  0 

Xn  ~  1  1  0  XJ 

ri  -(Xz+pi,)]  rn  1 


[-1  X,  4-pI,  J  [oj 

X|+P|2  X2  4-  p||  1  I"  X?  1  l 

=  7 — r-  (434) 

1  1  [— x?  x> 

Hence  the  central  diagonal  element  of  the  inverse  of  61  is 

A4„  [1  PE:,, 

A2„_i  A2rt  [0  l}Trt[l  0]*„ 

_ _ (X,/X2)^(X,  +p2|)-(X2  +p2|) _ 

-  ((X,/X2)n  -  1]KX,/X2)"(X,  +p1,)-(X2  +P5,)) 
•(X|  —  X2).  (4.35) 


A?rt  1  f  1  —  P12  — P21 1  f^Zn-Z  1 

=  .  (4.31) 

_  A2„_|  [  1  -P2IJ  [^27.-3 

|  If  we  consider  the  sequence  of  4n*4n  matrices  for  simplicity,  the 
|ccntral  diagonal  element  of  the  inverse  of  61  is  Aj„/( A2„-i  A2„). 
Hence,  after  introducing  the  state  vector 


So  finally 


iid  =  lim 


rt  —  OO  A2„-|  A 2 


=  X2  —  Xj 


/(1  ~  Pi:  -P2|)2  “  4P?zP§i 


a  we  see  that  finding  A2n,  A2fl-i  requires  finding  the  trajectory  of  the 


Fig.  7  shows  the  asymptotic  efficiency  of  the  decorrelating  detector 
for  infinite  transmitted  sequence  length  in  the  two  user  case.  Note  its 
invariance  with  respect  to  energies.  The  discrepancy  between  r)d  and 
jjmin.  defined  in  (3.4),  is  duo  to  the  fact  that  17  mm  is  higher  than  the 
near-far  resistance  of  optimum  multiuser  detection,  since  for  17  mm 
the  energies  are  constrained  to  be  constant  over  time. 
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R,2<1)  R,2(0) 


Fig.  8.  Signals  and  crosscorrelations  of  example  (4.42). 


The  fact  that  the  stable  version  of  the  decorrelating  filter  turns  out 
to  be  noncausal  is  not  surprising.  Due  to  the  lack  of  synchronism 
among  the  users  any  decision  based  on  less  than  the  entire  received 
waveform  is  suboptimal.  In  practice,  since  the  filter  is  stable,  the 
more  remote  symbols  will  count  less  heavily,  and  truncation  of  the 
noncausal  pan  will  be  performed  after  a  suitable  delay  without  affect¬ 
ing  performance  appreciably.  For  illustration  consider  the  two-user 
case  where  we  let  f?i2(0)  =  pt2  and  /?t2(l)  =  P21 .  Then 


S(Z)  = 


,  P12  +  P21Z 


P12  +  P21Z 
1 


-1 


and  the  transfer  function  of  the  decorrelating  detector  as  given  by 
(4.27)  is 


S"'(z)  = 


1 


1  ~  P12  ~  P21  —  P12P21Z  —  P12P21Z 


1  “(Pl2  +P2lZ-1)' 

— (Pl2  +P2lZ)  1 


(4.36) 


We  are  interested  in  the  impulse  response  f(rt)  of  the  IIR  part  of  the 
above  filter.  Taking  the  inverse  z-transform  it  is  found  to  be 


m=z-' 


1 


1  —  P|2  —  P21  “  PI2P21Z  —  PI2P21Z 


-1 


{l»l 

V 


(4.37) 


where  $  =  (1  -  pf2  -  p\\  -  i?)/(2pi2P2i)  and  ij  is  the  asymptotic 
efficiency  which  is  given  by  Proposition  7.  It  can  be  checked  that 
|S|  <  1.  with  equality  if  ]pi2|  +  IP21I  =  1.  which  can  be  shown  to 
coincide  with  the  condition  imposed  by  Proposition  3  for  the  two- 
user  case.  In  the  latter  case  the  asymptotic  efficiency  is  zero,  which 
follows  from  Proposition  7.  Otherwise,  since  j$|  <  1  the  limiting  fil¬ 
ter  is  stable,  with  symmetric  coefficients  which  decay  with  rate  $. 
In  practical  applications  the  filter  will  be  approximated  up  to  any 
desired  precision  by  truncation  of  the  noncausal  part  to  a  finite  num¬ 
ber  of  filter  coefficients.  For  illustration  the  decay  rate  £  of  the  filter 
coefficients  and  the  achievable  asymptotic  efficiency  ij  are  plotted  in 
Fig.  9  as  functions  of  P12  and  P21 . 

Poor  cross-correlation  properties  among  the  signature  waveforms 
vould  imply  that  the  limiting  filter  G(zj  does  not  exist,  although  the 
decurrelatir.g  detector  exists  for  finite-length  transmitted  sequences. 
We  give  an  example  to  illustrate  this  fact.  For  K  =  2  it  is  straight¬ 
forward  to  show  that  the  condition  of  Proposition  3  is  satisfied  for 
all  signal  constellations  for  which  If?i2(0)|  =  |/J|2(l)j  £  1.  This  is 
the  case  unless  the  normalized  waveforms  coincide  modulo  circular 
shifts  and  sign  changes. 

Consider  the  trivial  signal  case  where  both  users  are  assigned  the 
same  rectangular  waveform,  as  shown  in  Fig.  8.  Abbreviate  2(0), 
which  is  the  crosscorrelation  between  bits  in  the  same  signaling  in¬ 
terval,  by  r  —  7/T  €  (0,  1),  then  in  this  case  i?12(l),  which  is  the 


crosscorrelation  between  bits  in  adjacent  intervals,  is  1  -  r.  Then, 


S(Z)  = 


'  1 
r  +(1  -r)z 


r  +  (1  -r)z-1 
1 


(4.38) 


becomes  singular  for  z  =  1 ,  hence  there  is  no  stable  limiting  inverse 
filter.  And  if  it  existed  its  asymptotic  efficiency,  as  given  by  (4.28), 
would  be  zero.  This  is  not  surprising,  for  an  infinite  sequence  of 
transmitted  bits  where  both  users  use  the  same  waveform.  However, 
for  finite  length  sequences  advantage  can  be  taken  of  the  marginal 
effects  of  having  bits  which  are  not  affected  by  either  past  or  future 
bits.  For  finite  N  the  decorrelating  detector  exists  unless  r  =  0,  i.e. , 
when  the  transmissions  are  not  synchronous.  This  is  in  accord  with 
the  multiarrival  condition  given  in  Appendix  2,  and  with  the  results 
obtained  in  the  synchronous  case  [7]. 

V.  Error  Probabilities:  Numerical  Examples 

In  the  sequel,  we  compare  the  performances  of  the  conventional 
and  of  the  decorrelating  detector.  Without  loss  of  generality  we  con¬ 
sider  the  error  probability  of  user  1  in  a  channel  shared  by  several 
active  users.  The  conventional  detector  decides  for  the  sign  of  the 
kth  component  of  the  matched  filter  output  vector,  given  by  (2.14). 
Therefore  its  average  error  probability  over  the  bit  sequences  of  the 
interfering  users  equals 


1 


22(K  -  1) 


E 


M0>.M-I>,  JVI 

(  K  \ 

v/wT  -  £  Pi/W)W  +*I7(1)W-1>W 

7=2 


V 


,  (5.1) 


J 


whereas  its  worst  case  error  probability  over  the  interfering  bit  se¬ 
quences  equals 


/  *  \ 
;=2 


V 


(5.2) 


/ 


The  probability  of  error  of  the  decorrelating  detector  equals,  from 
(4.24), 


e(-fr= 

\ov/D„(  0) 

with  (the  equivalence  with  (4.23)  is  easy  to  show,  cf.  [15]) 

0)  =  -/  [l?(l)V“+if(0)+/?(l)e-;"lj;,r/a..  (5.3) 
T  Jo 

The  delays  enter  the  above  formulas  implicitly  via  the  crosscorrela¬ 
tion  matrices,  which  are  functions  thereof  and  of  the  chosen  signa 
ture  sequences.  In  the  following  examples,  we  have  chosen  a  set  of 
spread-spectrum  m- sequences  of  length  31. 

In  Fig.  10  we  use,  for  comparison  purposes  to  previous  works 
([14],  [1]),  the  set  of  3  sequences  reported  in  [12,  Table  V]  to  be 
optimal  with  respect  to  a  signal  to-multiple-access  interference  pa¬ 
rameter  when  the  conventional  detector  is  used.  We  consider  a  base¬ 
band  environment  with  K  - 1  active  equal  energy  interferes,  whose 
delay  relative  to  each  other  is  fixed.  Fig.  10,  for  K  -  3,  shows  the 
1st  user  error  probability  of  the  conventional  receiver  versus  SNRt, 
the  signal-to-background-noise  ratio  of  user  1,  for  different  values 
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Fig.  9.  (a)  Asymptotic  efficiency  of  the  decorrelating  detector  for  two  users 
as  a  function  of  the  partial  crosscorreiations  of  their  signature  waveforms, 
(b)  Decay  rates  of  the  coefficients  of  the  IIR  part  of  the  decorrelating 
detector  for  two  users,  symmetric  in  pa  and  P21 . 


Fig  10.  Error  probability  of  user  1  with  ?.  active  equal  energy  interferes, 
each  of  energy  Wj,  averaged  over  the  interfering  bit  sequences  and  over 
the  delay  of  user  1,  for  the  decorrelating  and  conventional  receiver  versus 
the  SNR  of  user  1,  for  /77-sequences  of  length  31  and.different  interference 
levels. 


of  the  energy  ratio  SNRj/SNR1 ,  averaged  over  the  bit  sequences  of 
the  two  interferes  and  over  the  delay  of  user  1 .  Also  shown  are  the 
1st  user  error  probability  of  the  decorrelating  detector  and  the  error 
probability  of  the  single  user  channel.  From  Fig.  10  we  see  the  strong 
dependence  of  the  performance  of  the  conventional  receiver  on  the 
relative  energies  of  the  active  users.  While  the  error  probability  of  the 
decorrelating  detector  is  invariant  to  the  energy  of  interfering  users, 
the  performance  of  the  conventional  receiver  deteriorates  rapidly  for 
increasing  interference,  till  for  an  energy  ratio  above  5  dB  the  con¬ 
ventional  receiver  becomes  practically  multiple-access  limited.  (Fora 
sufficiently  high  level  of  nonorthogonal  interference  the  error  proba¬ 
bility  of  the  conventional  receiver  can  be  seen  to  become  irreducible. 
E.g.,  in  the  two-user  synchronous  case,  for  V'vT/Vw’T  -  (1+  A)/p 
where  p  is  the  normalized  crosscorrelation  coefficient  between  the 
two  signature  signals  and  A  >  0,  the  error  probability  of  the  con¬ 
ventional  receiver  tends  to  1/4  if  A  =  0  and  to  1/2  if  A  >  0  for 
increasing  SNR  of  user  1).  Note  that  if  the  energies  of  all  the  users 
are  equal  the  decorrelating  detector  is  around  two  orders  of  mag¬ 
nitude  better  than  the  conventional  receiver  at  10  dB.  Only  if  the 
multiple-access  interference  level  plays  a  subordinate  role  compared 
to  the  background  noise  does  the  conventional  detector  outperform 
the  decorrelating  detector,  which  pays  a  penalty  for  combatting  the 
interference  instead  of  ignoring  it.  Similar  results  were  obtained  re¬ 
gardless  of  the  actual  value  of  the  relative  delay  between  the  two 
interfering  users. 

Fig.  1 1  shows  the  same  setting  as  above,  in  the  case  K  =  6.  We 
have  used  the  set  of  autooptimal  777-sequences  of  length  31  found  in 
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Fig.  11.  Same  as  Fig.  10,  with  5  active  equal  energy  interfcrcrs. 


Fig.  12.  Worst  case  probability  of  user  1  w.r.t.  the  bit  sequences  of  the 
interfering  users,  with  ?  equal  energy  interferes. 

[13,  Fig.  A.l]  to  be  optimal  with  respect  to  certain  peak  and  mean- 
square  correlation  parameters  which  play  an  important  role  in  the 
error  probability  analysis  of  the  conventional  detector.  Comparing 
Fig.  11  to  Fig.  10  we  see  the  same  qualitative  error  probability 
relation  between  the  two  detectors,  and  again  the  strong  near-far 
limitation  of  the  conventional  receiver.  Since  there  are  more  active 
interferers  the  performance  advantage  of  the  decorrelating  detector 
in  a  near-far  environment  is  even  more  pronounced:  if  the  energies 
of  all  the  users  are  equal  the  decorrelating  detector  is  almost  three 
orders  of  magnitude  better  than  the  conventional  receiver  at  10  dB, 

Finally,  Fig.  12  shows  the  worst  case  probability  of  the  conven¬ 
tional  detector  over  the  sequences  of  interfering  users,  as  given  by 
(5.2),  for  K  =  10.  The  signature  sequence  set  used,  for  K  =  6 
has  been  expanded— without  trying  to  optimize,  as  before,  with  re¬ 
spect  to  the  performance  of  the  conventional  detector.  The  shown 
error  probabilities  are  typical,  varying  very  little  if  different  sets  of 
delays  are  used  because  of  the  good  crosscorrelation  properties  of 
m-sequences. 

Overall  the  generated  error  probability  curves  show  the  pro¬ 
nounced  superiority  of  the  decorrelating  receiver  in  a  near-far  envi¬ 
ronment,  and  whenever  sufficiently  many  users  are  active  even  if  their 
energies  are  well  below  the  energy  of  the  desired  user.  Note,  finally, 
that  the  selected  signature  sequences  were  optimal  with  respect  to  the 
performance  of  the  conventional  receiver.  It  would  be  interesting  to 
investigate  the  possible  performance  gain  of  using  the  decorrelating 


detector  in  conjunction  with  a  set  of  signature  sequences  optimized 
for  its  use. 

VI.  Conclusions 

In  this  paper,  we  have  obtained  a  linear  multiuser  detector, 
the  decorrelating  detector,  for  demodulation  of  asynchronous  code¬ 
division  multiplexed  signals  in  white  Gaussian  channels.  The  bit¬ 
error-rate  of  this  detector  is  independent  of  the  energy  of  the  inter¬ 
fering  users  and  exhibits  the  same  degree  of  near-far  resistance  as 
the  optimum  multiuser  detector  obtained  in  [1].  Since  the  decorrelat¬ 
ing  detector  does  not  require  knowledge  of  the  received  energies  and 
its  complexity  is  only  linear  in  the  number  of  users,  it  emerges  as 
the  solution  of  choice  in  near-far  environments  with  a  large  number 
of  users. 

In  applications  where  each  receiver  is  interested  in  demodulating 
the  information  transmitted  by  only  one  user,  it  is  easy  to  decentralize 
ti,e  AT-uscr  decorrelating  receiver  since  it  can  be  implemented  as  K 
separate  (continuous-time)  single-input  (discrete-time)  single-output 
filters.  Each  of  those  filters  can  be  viewed  as  a  modification  of  the 
conventional  single-user  matched  filter  where  instead  of  correlating 
the  channel  output  with  the  signature  waveform  of  the  user  of  interest, 
we  use  its  projection  on  the  subspace  orthogonal  to  the  space  spanned 
by  the  interfering  signals. 

Note,  finally,  that  if  the  filter  is  actually  an  approximation  to  the 
decorrelating  receiver,  due  to,  for  example,  finite  accuracy  in  the 
computation  of  the  crosscorrelations  or  truncation  of  the  impulse 
response  (Section  IV-B),  it  will  no  longer  be  orthogonal  to  the  sub¬ 
space  of  the  interfering  signals  and  therefore  it  will  not  be  near-far 
resistant  in  the  worst  case  sense  adopted  in  this  paper.  However, 
the  effect  on  the  bit-error-rate  will  be  arbitrarily  small  with  a  good 
enough  approximation  to  the  decorrelating  receiver,  and  therefore  the 
bit-error-rate  will  be  very  insensitive  to  the  energy  level  of  the  inter¬ 
ferers.  Hence,  the  resistance  to  the  near-far  problem  can  be  preserved 
within  any  desired  energy  range. 

Appendix  1 

Saddle-Point  Property  in  (3.19) 

Though  the  penalty  (unction  of  (3.18)  looks  similar  to  the  signal- 
to-noise  ratio  fimctional  encountered  in  the  robust  matched  filtering 
problem  [5],  |{A, s)\2/{h,  £/i),  the  problem  is  different  here  be¬ 
cause  the  numerator  can  be  negative.  Thus  we  have  to  establish  the 
result  “from  scratch.”  In  order  to  show  that  M(y,y)  has  a  saddle 
point,  i.e,,  satisfies  (3.19),  we  show  that  it  satisfies  the  requirements 
ofrthe  following  theorem. 

Theorem  (4,  Thm.  2.1):  Suppose  Q  is  a  convex  set  and  M(v,  ■ ) 
is  convex  on  Q  for  every  v  g  H.  Then  if  (vj,,^^)  is  a  regular  pair* 
for  (H,  Q,  M),  the  following  are  equivalent: 

a)  yi  €  argmin  sup  M(v,.y), 

xSSve/r 

b)  (vi,yL)  is  a  saddle  point  solution  for  (H,  Q,  Nf). 

This  theorem  establishes  that  if  we  exhibit  a  regular  pair  whose  sec¬ 
ond  argument  satisfies  a),  the  game  ( H ,  Q,  M)  has  a  saddle  point, 
which  means  that  the  sequence  of  max  and  min  in  (3.18)  can  be  in¬ 
terchanged.  In  the  following,  we  find  a  suitable  regular  pair,  thereby 
proving  (3.19). 

Clearly  the  convexity  conditions  are  satisfied  (the  set  of  detectors  is 
not  required  to  be  topologized).  We  need  to  find  a  candidate  regular 
pair.  Note  that  the  value  of  inf  term  in  (3.18)  is  -  cc  (which  gives  a 
near-far  resistance  of  zero)  unless  v  is  picked  such  that  (Rv  =zuk,‘  (77 
is  inva.iant  with  respect  to  scaling  of  v).uk-‘  is  the  (Ar,  /)th  unit  vector 
in  the  Hilbert  space  L,  defined  as  Uj'(!)  =  hfiii-  This  gives  us  a 
candidate  for  an  optimal  detector  vi.:  d,  with  <Rd  =  «*•'.  Existence 
of  such  a  vector  is  shown  in  (4.6)  to  follow  from  the  LIA  of  (2.3). 

*(V[.,yL)  eh  x  Q  is  a  regular  pair  for  (H,  Q,  M)  if,  for  every y  e  Q 
such  thatya  =  ( 1  -  a]yL  ~ ay  e  Q  for  a  €  (0,  1),  we  nave 

sup  M(v,ya )  - M(vL,ya)  =  o(a). 

rSH 
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(If  tliis  detector  is  indeed  optimal,  which  follows  if  the  candidate  pair 
is  regular  and  satisfies  a),  and  coincides  with  v*). 

Next  we  find  a  yL  which  meets  the  requirement  of  point  a)  of  the 
theorem.  Using  the  Cauchy-Schwarz  inequality,  we  find  that 

supAf(v,>’)=  sup  y-=  =  =  JyT<&y  (A,l) 

v<=«  vSL  VWtRp 

where  the  inner  product  is  maximized  for  v  =  Ay  -f  {x  6  L:  (Rx  = 

0}. 

We  now  need  to  solve  the  Hilbert  space  optimization  problem 

infyr(Ry  (A.2) 

subject  to  yk(i)  =  1. 

Using  (2.21)  and  the  definition  of  d  we  can  rewrite  the  minimization 
problem  under  consideration  as 

inf  |ly||R  (A.3) 

subject  to  (flty)R  =  1. 


y&v/vVtRv  has  a  saddle  point,  i.e., 


sup  inf 
vgi  y$L 
vr<Rvyi  >*<*)"< 


j>r(Rv 

Vvr(Rv 


inf  inf 

y£L 

ytO)<* i  vr<sv'^« 


yr<Rv 

s/wr(Rv 


(A.7) 


Appendix  2 


❖ 


Sufficient  Conditions  for  Linear  Independence 

Suppose  that,  for  a  fixed  signal  set, 
i)  {n  tk  }  are  continuous  random  variables, 
it)  {ti  ,  ■  •  ■ ,  tk  }  are  independent  random  variables, 
iii)  wk(i)  5^  0. 

Then  almost  surely  there  is  no  v  SZ,,u*(/)  ^0  such  that  S(/,v)  =0. 

Proof:  Define  the  times  of  effective  arrival  and  departure  of  the 
rth  signal  of  the  &th  user  [1],  as 


X“*  =  Tk  +iT  +  sup  |r  €  [0,  T),  J  s\(t)dt  =  0  j  (A.8) 
X^=r*+iT  +  inf  |r€(0,r],  jf  s|(/)c(/=o|,  (A.9) 


ll-IU  is  a  norm  since  <R  is  positive  definite.  We  have  obtained  a 
minimum-norm  optimization  problem  in  Hilbert  space.  To  prove  ex¬ 
istence  of  a  solution-we  need  to  show  that  constraint  set  to  be  closed, 
which  holds  since  the  Hilbert  space  is  finite  dimensional.  (Even  for 
N  — *  co,  when  we  have  an  infinite  dimensional  optimization  prob¬ 
lem,  we  could  use  the  fact  that  the  codimension  is  finite.  The  prob¬ 
lem  there  is  that  the  signals  are  no  longer  square  integrate.;  The 
constraint, yk(i)  =  1,  is  equivalent  toy  -uk-‘  +  {r:  p,d)R  =0}. 
&  =  [</],  the  subspace  generated  by  d,  is  a  closed  subspace  of  dimen¬ 
sion  1.  Hence  the  constraint  set  {x;  d):i  =  0}  =  Ctx  is  closed. 
We  now  have  a  minimum-norm  optimization  problem  in  Hilbert  space 
over  a  closed  subspace.  Hence,  the  Projection  Theorem,  [6],  guar¬ 
antees  existence  [so  we  can  replace  the  inf  by  a  min,  as  required  in 
a)]  and  uniqueness  of  a  minimizing  equivalence  classy*,  with 

y*  €  {ax  +!/*•' }naxx  =  {ax  +«*■'} not  (A.4) 


where  equality  holds  since  A  is  closed.  Hence  y"’1  =  1  andy  *  = 
kd,  which  implies 


ym 


dk0)d' 


(A.5) 


We  now  have  a  candidate  regular  pair  which  satisfies  a):  (vi,yL)  = 
(d,  (dk0))~ld).  From  (A.l)  and  the  definition  of  regularity  we  have 
to  check  the  dependence  on  a  of 


\fyTa<^ya  - 


yT„&vi. 

\fvTL6lvL 


=  y/dr<M+2a(y  -d)T<M  +a2(y  -d)r(R(y  -d) 


V  dk(i) 

-\fzO> <a-s> 

We  have  repeatedly  used  the  decorrelating  property  of  d  Since 
f\,+  x  <  1  + 1  !2x.  the  above  quantity  lies  in  the  interval  (0,  (y  - 
d)T  ft(y  -d)Jdk(i)/2a  ],  hence  divided  by  a  goes  to  0  when  a  10. 
Thus  ( d ,  (dk{i))~'d)  is  a  regular  pair  which  satisfies  point  a)  of  the 
theorem.  Hence  it  follows  from  the  theorem  that  the  penalty  function 


respectively. 

Since  u*(/)  f  0  there  is  a  first  and  a  last  symbol  that  differs  from 
zero.  It  is  readily  apparent  that  in  order  to  have  S(t,  v)  =  0,  the 
effective  arrival  of  the  first  (and  the  effective  departure  of  the  last) 
symbol  that  differs  from  zero  must  be  a  point  of  effective  multiar 
rival  (respectively  multideparture).  Note  that  this  property  does  not 
depend  on  the  particular  v  chosen,  but  only  on  the  set  of  delays. 
From  (A.8),  (A.9),  the  effective  times  of  arrival  and  departure  in¬ 
herit  from  the  delays  the  properties  of  being  continuously  valued  and 
mutually  independent.  Therefore,  the  result  follows,  since  the  set  of 
delays  {r, ,  ,  tk  }  for  which  multiarrival  points  result  has  measure 

zero.  <> 
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Abstract  —In  code-division  multiple-access  systems,  simultaneous  mul¬ 
tiuser  accessing  of  a  common  channel  is  made  possible  by  assigning  a 
signature  waveform  to  each  user.  Knowledge  of  these  waveforms  enables 
the  receiver  to  demodulate  te  data  streams  of  each  user,  upon  observation 
of  the  sum  of  the  transmitted  signals,  perturbed  by  additive  noise.  Under 
the  assumptions  of  symbol-synchronous  transmissions  and  white  Gaussian 
noise,  we  analyze  the  detection  mechanism  at  the  receiver,  comparing 
different  detectors  by  their  bit  error  rate  in  the  low  background  noise 
region,  and  by  their  worst-case  behavior  in  a  near-far  environment  where 
the  received  energies  of  the  users  are  not  necessarily  similar.  Optimum 
multiuser  detection  achieves  important  performance  gains  over  conven¬ 
tional  single-user  detection  at  the  expense  of  computational  complexity 
that  grows  exponentially  with  the  number  of  users.  It  is  shown  that  in  the 
synchronous  case  the  performance  achieved  by  linear  multiuser  detectors 
(whose  complexity  per  demodulated  bit  is  only  linear  in  the  number  of 
users)  is  similar  to  that  of  optimum  multiuser  detection.  Attention  is 
focused  on  detectors  whose  linear  memoryless  transformation  is  a  general¬ 
ized  inverse  of  the  matrix  of  signature  waveform  crosscorrelations,  and  on 
the  optimum  linear  detector.  It  is  shown  that  the  generalized  inverse 
detectors  exhibit  the  same  degree  of  near-far  resistance  as  the  optimum 
multiuser  detector;  the  optimum  linear  detector  is  obtained  subsequently, 
along  with  sufficient  conditions  on  the  signal  energies  and  crosscorrela¬ 
tions  to  ensure  that  its  performance  is  equal  to  that  of  the  optimum 
multiuser  detector. 

I.  Introduction 

ODE-DIVISION  multiple-access  is  a  multiplexing 
technique  where  several  independent  users  access  si¬ 
multaneously  a  multipoint-to-point  channel  by  modulating 
preassigned  signature  waveforms.  These  waveforms  are 
known  to  the  receiver,  which  observes  the  sum  of  the 
modulated  signals  embedded  in  additive  white  Gaussian 
noise.  If  the  assigned  signals  were  orthogonal,  then  a  bank 
of  decoupled  single-user  detectors  (matched  filters  fol¬ 
lowed  by  thresholds)  would  achieve  optimum  demodula¬ 
tion.  In  practice,  however,  orthogonal  signal  constellations 
are  more  the  exception  than  the  rule  because  of  bandwidth 
or  complexity  limitations  (the  number  of  potential  users 
can  be  very  large),  lack  of  synchronism,  or  other  design 
constraints.  Therefore  the  question  of  interest  is  how  to 
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demodulate  the  transmitted  messages  when  the  assigned 
signals  are  not  orthogonal.  In  practice,  demodulation 
strategies  have  been  restricted  to  single-user  detection, 
thereby  placing  the  whole  burden  of  complexity  on  the 
cross  correlation  properties  of  the  signal  constellation. 
Recently,  the  optimum  multiuser  detector  for  general  asyn¬ 
chronous  Gaussian  channels  was  derived  and  analyzed  in 
[1].  The  optimum  detector  significantly  outperforms  the 
conventional  single-user  detector  at  the  expense  of  a 
marked  increase  in  computational  complexity— it  grows 
exponentially  with  the  number  of  users. 

The  purpose  of  this  paper  is  to  investigate  new  low-com¬ 
plexity  multiuser  detection  strategies  that  approach  the 
performance  of  the  optimum  detector  and  to  gain  further 
insight  into  the  performance  of  the  optimum  multiuser 
detector.  Our  attention  in  focused  on  symbol-synchronous 
channels,  where  the  symbol  epochs  of  all  users  coincide  at 
the  receiver.  Although  in  practice  this  assumption  rules  out 
the  important  class  of  completely  asynchronous  code-divi¬ 
sion  multiple-access  systems,  it  holds  in  slotted  channels, 
and  its  study  is  a  necessary  prerequisite  for  tackling  the 
general  asynchronous  channel  by  allowing  us  to  gain  some 
appreciation  of  the  main  issues  in  the  simplest  possible 
setting. 

The  performance  measure  of  interest  is  the.  probability 
of  error  of  each  user.  In  multiuser  problems  it  is  often 
more  convenient  and  intuitively  sound  to  give  information 
concerning  the  error  probability  by  means  of  the  efficiency, 
or  ratio  between  the  effective  signal-to-noise  ratio  (SNR) 
and  the  actual  SNR,  where  the  effective  SNR  is  the  one 
required  to  achieve  the  same  probability  of  error  in  the 
absence  of  interfering  users,  and  the  actual  SNR  is  the 
received  energy  of  the  user  divided  by  the  power  spectral 
density  level  of  the  background  thermal  white  Gaussian 
noise  (not  including  interference  from  other  users).  Note 
that  since  the  single-user  error  probability  is  a  one-to-one 
function  of  the  SNR,  the  efficiency  gives  the  same  infor¬ 
mation  as  the  error  probability.  Its  limit  as  the  background 
Gaussian  noise  level  goes  to  zero,  the  asymptotic  efficiency, 
characterizes  the  underlying  performance  loss  when  the 
dominant  impairment  is  the  existence  of  other  users  rather 
than  the  additive  channel  noise.  Denoting  the  power  spec¬ 
tral  density  level  of  the  background  white  noise  by  o2,  the 
kth  user  asymptotic  efficiency  of  a  detector  whose  k  th 
user  error  probability  and  energy  are  equal  to  Pk  and  wk. 
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respectively,  can  be  written  as  [l]1 

O^r^l;  \im^Pk(o)/Q^^-— +oo|, 

(1*1) 

i.e.,  the  logarithm  of  the  error  probability  goes  to  zero  with 
the  same  slope  as  the  single-user  bit  error  rate  with  energy 
rikwk.  In  this  paper  we  compare  the  performance  of  the 
various  multiuser  detectors  by  means  of  the  asymptotic 
efficiency.  In  the  high2  SNR  region,  the  advantage  of  this 
measure  over  the  probability  of  error  is  twofold,  it  quanti¬ 
fies  the  performance  degradation  due  to  the  existence  of 
other  users  in  a  simple,  intuitive  way,  and  in  contrast  to 
multiuser  error  probability  for  which  only  (asymptotically 
tight)  upper  and  lower  bounds  are  known  [1],  exact  expres¬ 
sions  for  the  asymptotic  efficiency  are  feasible. 

The  main  shortcoming  of  currently  operational  networks 
employing  code-division  multiple-access  is  the  near -far 
problem.  This  refers  to  the  situation  wherein  the  received 
powers  of  the  users  are  dissimilar  (e.g.,  in  mobile  radio 
networks).  Since  the  output  of  the  matched  filter  of  each 
user  contains  a  spurious  component  which  is  linear  in  the 
amplitude  of  each  of  the  interfering  users,  the  error  proba¬ 
bility  increases  to  1/2  as  the  multiuser  interference  grows, 
the  asymptotic  efficiency  becomes  zero,  and  the  conven¬ 
tional  single-user  detector  is  unable  to  recover  reliably  the 
messages  transmitted  by  the  weaker  users  even  if  signals 
with  very  low  crosscorrelations  are  assigned  to  the  users. 
However,  the  near-far  problem  is  not  an  inherent  charac¬ 
teristic  of  code-division  multiple-access  systems.  Rather,  it 
is  the  inability  of  the  conventional  single-user  receiver  to 
exploit  the  structure  of  the  multiple-access  interference 
that  accounts  for  the  ubiquity  of  the  near-far  problem  in 
practice.  We  show  that  the  optimum  multiuser  detector 
and  other  multiuser  detectors  with  much  lower  computa¬ 
tional  complexity  are  near  -  far  resistant  under  mild  condi¬ 
tions  on  the  signal  constellation.  By  near-far  resistance  we 
mean  the  asymptotic  efficiency  minimized  over  the  ener¬ 
gies  of  all  the  interfering  users.  If  this  minimum  is  nonzero, 
and,  as  a  consequence,  the  performance  level  is  guaranteed 
no  matter  how  powerful  the  multiuser  interference,  then 
we  say  that  the  detector  is  near-far  resistant. 

The  organization  of  the  rest  of  the  paper  is  as  follows. 
The  asymptotic  efficiency  and  the  near-far  resistance  of 
both  the  conventional  and  the  optimum  detectors  are  given 
in  Section  II.  In  Section  III,  we  introduce  the  decollating 
multiuser  detector.  This  detector  linearly  transforms  each 
vector  of  matched  filter  outputs  with  a  generalized  inverse 
of  the  signal  crosscorrelation  matrix.  It  is  shown  that, 
somewhat  unexpectedly,  the  near- far  resistance  of  the 
optimum  multiuser  detector  coincides  with  that  of  the 
decorrelating  detector  whose  complexity  per  demodulated 


xQ(x)~fr(W^)e'vl/2dv. 

5  In  the  numerical  results  of  (1]  and  [2],  the  efficiency  is  indistinguish¬ 
able  from  the  asymptotic  efficiency  for  SNR’s  higher  than  7  dB. 


bit  is  only  linear  in  the  number  of  users.  Finally,  Section 
IV  investigates  the  performance  of  the  optimum  linear 
transformation  and  gives  sufficient  conditions  on  the  sig¬ 
nal  energies  and  crosscorrelations  to  ensure  that  the 
asymptotic  efficiency  of  the  optimum  linear  transforma¬ 
tion  is  equal  to  that  of  the  optimum  multiuser  detector. 

II.  Single-User  Detection  and  Optimum 
Multiuser  Detection 

Suppose  that  the  k  th  user  is  assigned  a  finite  energy 
signature  waveform,  {^(r),  re[0, 7’]},  and  that  it  trans¬ 
mits  a  string  of  bits  by  modulating  that  waveform  antipo- 
dally.  If  the  users  maintain  symbol  synchronization  and 
share  a  white  Gaussian  multiple-access  channel,  then  the 
receiver  observes 

K0=  £  bk{j)sk(t-  jT)  +  on{t), 

k- 1 

te[jTJT+T]  (2.1) 

where  n(t)  is  a  realization  of  a  unit  spectral  density  white 
Gaussian  process  and  [bk(j)£  {-l,l))y  is  the  Ath  user 
information  sequence.  Assuming  that  all  possible  informa¬ 
tion  sequences  are  equally  likely,  it  suffices  to  restrict 
attention  to  a  specific  symbol  interval  in  (2.1),  e.g.,  j  =  0. 

It  is  easy  to  check  that  the  likelihood  function  depends 
on  the  observations  only  through  the  outputs  of  a  bank  of 
matched  filters: 

yk  =  fTr(t)sk(t)dt,  (2.2) 

■'o 

and  therefore  y  =  (yt,- •  ■,  yK)  are  sufficient  statistics  for 
demodulating  b  -  (£,,•  •  •,  bK).  We  investigate  ways  of 
processing  these  sufficient  statistics,  which  according  to 
(2.1)  and  (2.2)  depend  on  the  transmitted  bits  in  the 
following  way: 

.  y  =  Hb  +  n  (2.3) 

where  H  is  the  nonnegative  definite  matrix  of  crosscorrela¬ 
tions  between  the  resigned  waveforms: 

H„  =  f^sMs^t)  dt  (2.4) 

and  its  diagonal  entries  are  the  energies-per-bit,  H„  =  wt  > 
0,  of  each  user;  and  n  is  a  zero-mean  Gaussian  A'-vector 
with  covariance  matrix  equal  to  alH. 

Conventional  single-user  detection  is  the  simplest  way  to 
make  decisions  based  on  yk,  demodulation  is  decoupled 
and  the  multiuser  interference  is  ignored,  yield  ug  the 
following  decisions  for  the  k  th  user: 

bck  =  s&iyk. 

On  the  other  hand,  the  optimum  multiuser^  detector 
selects  the  most  likely  hypothesis  b*  =  (b*,  , bk)  given 

the  observations,  which  corresponds  to  selecting  the  noise 
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realization  with  minimum  energy,  i.e., 


b*  s  arg  min 

be  {-l.i)*  •'0 


r(t)~  Z  M*( 0 

A-l 


(it 


=  arg  max  2yTb-  brHb. 

Ae{-t.i)* 


(2.5) 


The  computational  complexities  of  the  single-user  detector 
and  the  optimum  multiuser  detector  are  radically  different. 
While  the  time-complexity  per  bit  (TCB)  of  the  single-user 
detector  is  independent  of  the  number  of  users,  no  algo¬ 
rithm  that  solves  (2.5)  in  polynomial  time  in  K  is  known. 
The  reason  for  this  is  the  nondeterministic  polynomial 
(NP)-completeness  of  optimum  multiuser  detection  (Ap¬ 
pendix  I). 

The  performances  of  the  detectors  are  also  quite  differ¬ 
ent.  It  is  straightforward  to  find  the  Ath  user  probability 
of  error  of  the  conventional  single-user  detector: 


PZ=P[yk>o\bk  =  -i] 

-  I  P[yk>^y[t>\bk  =  ->} 


be  (  —  1,1}* 
h„--l 


(wk~  Z  b;Hik  \ 


=  2»‘*  Z  Q 


Ae(-l,ij* 


i*k 


(2.6) 


In  the  low  background  noise  region,  the  foregoing  sum¬ 
mation  is  dominated  by  the  term  corresponding  to  the 
least  favorable  bits  of  the  interfering  users,  i.e.,  b,= 
sgn (Hlk)  Thus  the  asymptotic  efficiency  of  the  conven¬ 
tional  detector  is  equal  to 


ffk  =  sup  ( 0  r  <;  1 ;  lim  Pk/Q 

1  o-O 


<  +oo , 


=  max2 


L 


wk  I 


=  max2 


I 


(2.7) 


where  R  is  the  matrix  of  normalized  (unit-energy)  cross 
correla  ions,  i.e., 

H=*WX/2RWX/2  (2.8) 

where  W  =  diag{  Wj,-  •  -  ,wK}.  It  follows  from  (2.7)  that  the 
conventional  Ath  user  detector  is  near-far  resistant  (i.e., 
its  asymptotic  efficiency  is  Pounded  away  from  zero  as  a 
function  of  the  interfering  users’  energies)  only  if  Rlk  -  0 
for  all  /  *  k,  i.e.,  only  if  the  Arth  user’s  signal  is  orthogonal 
to  the  subspace  spanned  by  the  other  signals.  Otherwise, 

5£=  inf  ^  =  0.  (2.9) 

i  +  k 

The  Ath  user  error  probability  of  the  optimum  multiuser 
receiver  is  asymptotically  (as  a  -» 0)  equivalent  to  that  of  a 
binary  test  between  the  two  closest  hypotheses  that  differ 
in  the  Ath  bit  (see  [1]).  The  square  of  the  Euclidean 


distance  between  the  signals  corresponding  to  these  two 
hypotheses  is  equal  to 


mm  mm 

be  (  —  1.1)*  de{- 1,1}* 


Z  b,Si(t)-  Z  disM 
1-1  1-1 


2  min  er//c.  (2.10) 
<€{-1.0.1}* 

<*“t 


Hence  the  asymptotic  efficiency  of  the  optimum  mul¬ 
tiuser  detector  is  equal  to 

t n~ —  min  c  THt.  (2.11) 

Wk  <€{-1.0,1}* 

<*-t 

This  is  the  highest  efficiency  attainable  by  any  detector 
becaur  as  or  -» 0  the  optimum  multiuser  detector  achieves 
minimum  probability  of  error  for  each  user.  In  the  two-user 
case,  denoting  P  —  Al2>  (2.11)  reduces  to 


ijl  =  min 


w. 


1,1  +  —  -2|p| 


w. 


(2.12) 


and- similarly  for  user  2.  Unfortunately,  no  explicit  expres¬ 
sions  are  known  for  (2.11)  in  general.  In  fact,  the  combina¬ 
torial  optimization  problem  in  (2.11)  is  also  NP-complete 
(Appendix  I). 

Nevertheless,  it  is  indeed  possible  to  obtain  a  closed-form 
expression  for  the  near-far  resistance  of  the  optimum 
multiuser  detector,  because  the  minimization  of  the  asymp¬ 
totic  efficiencies  with  respect  to  the  energies  of  the  interfer¬ 
ing  user  waveforms  reduces  the  combinatorial  optimization 
problem  in  (2.11)  to  a  continuous  optimization  problem 
whose  solution  is  given  by  the  following  result. 

Proposition  1:  Denote  by  R+  the  Moore- Penrose  gener¬ 
alized  inverse3  of  the  normalized  crosscorrelation  matrix 
R.  If  the  signal  of  the  Ath  user  is  linearly  independent,  i.e., 
it  does  not  belong  to  the  subspace  spanned  by  the  other 
signals,  then 


(2.13) 


Otherwise,  r\k  =  0. 


Proof:  Using  (2.11)  for  the  maximum  asymptotic  effi¬ 
ciency  of  the  /c  th  user,  we  obtain 

1  r 

T)k  =  mm  nun  — iHt 

*,2.0  <e  {-1,0,1}*  Wk 
i  +  k  <*- 1 

=  min  min  — tTWx/2RWx/2t 

"iZO  <e  {-1,0,1}*  wk 
i  +  k  <k-i 

=  min  x  rRx 

xe  R* 

=  rain  .(1  +  2 zrok  +  zTRkz)  (2.14) 

i  e  ft*-1 


3A  generalized  inverse  A  of  a  matrix  B  is  any  matrix  that  satisfies  1. 
ABA-  A  and  2.  BAB  -  B.  The  Moo  re-Penrose  generalized  inverse  is  the 
unique  generalized  inverse  that  satisfies  3.  AB  and  BA  are  Hermitian. 
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where  Rk  is  obtained  from  R  by  deleting  the  A:  th  row  and 
column  and  ak  is  the  k  th  column  of  R  with  the  A:  th  entry 
removed.  Henceforth,  we  denote  such  a  partitioning  of  a 
symmetric  matrix  with  respect  to  the  A:  th  row  and  column 
by  /?  =  [/?*,  a*,  1],  where  the  rightmost  element  in  the 
square  brackets  is  the  kth  diagonal  entry.  The  minimum  in 
the  right  side  of  (2.14)  is  achieved  by  any  element  z*  such 
that 

RkZ*  =  ~ak.  (2.15) 

Because  of  the  Fredholm  theorem  (11,  p.  115],  the  solv¬ 
ability  of  (2.15)  is  equivalent  to  a*  being  orthogonal  to  the 
null  space  of  Rk.  However,  for  all  z  e  RK~l  the  parabola 
q{v)  =  v2  +  2ozrak  +  zrRkz  has  at  most  one  zero  because 
it  is  equal  to  the  quadratic  form  of  the  nonnegative  defi¬ 
nite  matrix  R  with  a  vector  whose  &th  coordinate  is  v  and 
whose  other  components  are  equal  to  z.  Therefore,  the 
discriminant  of  the  parabola  satisfies  (zTa  k)2  -  zrR  kz  ^  0; 
in  particular,  if  z  belongs  to  the  null  space  of  Rk,  then 
zTak  =  0.  So  ak  is  indeed  orthogonal  to  the  null  space  of 
Rk.  Substituting  (2.15)  into  (2.14)  we  obtain 

r\k  =  \-Z*TRkZ* 

=  1  -z*TRkRtRkz* 

=  1  -aTkR+kak.  (2.16) 

Notice  that  the  k  th  user  is  linearly  dependent  if  and 
only  if  there  exists  a  linear  combination  of  the  columns  of 
R  that  includes  the  Ac  th  column  and  is  equal  to  the  zero 
vector.  Therefore,  if  a  user  is  linearly  dependent  then  we 
can  find  x  such  that  Rx  =  0  and  xk  =1,  in  which  case  the 
penultimate  equation  in  (2.14)  indicates  that  rj*  =  0.  To 
obtain  the  near-far  resistance  of  a  linearly  independent 
user,  we  employ  the  following  property,  which  is  invoked 
again  later  on. 

Lemma  1:  If  the  k  th  user  is  linearly  independent,  then 
every  generalized  inverse  Rr  of  R  satisfies:  (R!R) kj 
( RR')jk  =  SJk  and  R'kk  =  Rkk.  (Analogous  formulas  hold 
for  the  unnormalized  crosscorrelation  matrix  H.) 

Proof  of  Lemma  1:  Let  S  =  R'R  -  /.  By  the  definition 
of  generalized  inverse,  it  follows  that  RS  =  0,  i.e.,  every 
column  of  S  is  in  the  null  space  of  R.  However,  if  the  k  th 
user  is  linearly  independent,  it  is  necessary  that  the  &th 
element  of  each  such  column  be  zero.  Hence  (RlR  -  I)kl 
=  0  for  all  ;  =  1, •••,£. 

Similarly,  with  S  =  RRr  - 1  and  SR~0,  we  obtain 
(RRf)jk  =  Sjk.  Equivalently,  RR'uk  =  uk,  using  the  Ac  th 
unit  vector  uk.  Hence,  for  any  generalized  inverses 
R j,  R'2,  R(R{  -  R2)uk  =  0.  However,  since  the  &th  user  is 
linearly  independent,  it  is  necessary  that  the  k\h  element 
of  each  vector  in  the  null  space  of  R  be  zero.  Hence 
(f?[-  R2)kk  =  0. 

Now  we  continue  with  the  proof  of  Proposition  1.  Parti¬ 
tioning  R+  with  respect  to  the  Arth  row  and- column,  we 
have,  say,  R+  =  [C,c,y\.  Now,  computing  the  submatrices 
of  the  partitioned  matrix  R+R  and  using  Lemma  1,  it 


follows  that 


Rkc  +  yak  =  0  (2.17) 

and 

cTaA  +  Y  =  l.  (2.18) 


Notice  that  v  #  0,  for  otherwise  c  would  belong  to  the  null 
space  of  Rk  and  would  not  be  orthogonal  to  ak ,  which,  as 
we  saw,  is  not  possible.  Finally,  substituting  (2.17)  into 
(2.16)  we  obtain 


1  T  * 

1  ^2  C  R kc 

1  T 

1 - 2cTRkc 

Y2 


=  1 


+  —cTak 
Y 


1  1 
Y  Rtk 


(2.19) 


where  the  second,  third,  and  fourth  equations  follow  from 
the  definition  of  generalized  inverse,  (2.17)  and  (2.18), 
respectively. 


III.  The  Decorrelating  Detector 

In  the  absence  of  noise,  the  matched  filter  output  vector 
is  y  —  Hx.  Thus  if  the  signal  set  is  linearly  independent 
(i.e.,  H  invertible),  the  natural  strategy  to  follow  in  this 
hypothetical  situation  is  to  premultiply  >>  by  the  inverse 
crosscorrelation  matrix  H~l.  The  detector  x  =  sgn //“'>> 
was  analyzed  in  [8],  where  its  performance  was  quantified 
in  the  presence  of  noise.  In  [6]  it  was  erroneously  shown 
(cf.  [3])  that  this  detector  is  optimum  in 'terms  of  bit-error 
rate.  Note  that  the  noise  components  in  H~  xy  are  corre¬ 
lated,  and  therefore  sgn  H~  ly  does  not  result  in  optimum 
decisions.  It  is  interesting  to  point  out  that  this  detector 
does  not  require  knowledge  of  the  energies  of  any  of  the 
active  users.  To  see  this,  let  yk  -  yk/{wk,  i.e.,  yk  is  the 
result  of  correlating  the  received  process  with  the  normal¬ 
ized  (unit-energy)  signal  of  the  k  th  user.  Then 

sgn  H~  ly  =  sgn  W~  i/2R  ~ 1  W~  l/2y 
=  sgn  W~l/2R~ly 
=  sgn  R~ly, 

and  therefore,  the  same  decisions  are  obtained  by  multiply¬ 
ing  the  vector  of  normalized  matched  filter  outputs  by  the 
inverse  of  the  normalized  crosscorrelation  matrix.  Apart 
from  the  attractive  asymptotic  efficiency  properties  shown 
below  for  the  decorrelating  detector,  further  justification 
for  its  study  is  provided  by  the  fact  that  it  is  the  solution  to 
the  generalized  likelihood  ratio  test  or  maximum  likelihood 
detector  (e.g.,  [12,  ch.  2],  [13,  p.  291])  when  the  energies  are 
not  known  by  the  receiver.  This  approach  selects  the 
decisions  that  maximize  the  maximum  of  the  likelihood 
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function  over  the  unknown  parameters,  i.e.,  (cf.  (2.5)) 


bs  e  arg  min 

cT 

K 

2 

min  / 

'■(')-  £  bksk(t) 

dt 

he  {-1.1}* 

wt  >  0  -o 

k-l 

i-i 

=  arg  min 

min  yTH~ly  +  bTHb  -  2  bTy 

he  {-l.l}* 

wt  >  0 

i-l.  -.K 

=  arg  min 

min 

he  ( —  t.i}* 

w,  >  0 
i-t.  -. K 

•yTR~lp  +  bT \Vl/1RWx/1b 

-l(Wl/2b)Ty 

=  sgn(arg  min 
'  is  rk 

xTRx-2xTy 

|  =  sgn/?"l.p. 

Since  in  this  paper  the  signal  set  is  not  constrained  to  be 
linearly  independent,  the  above  detector  need  not  exist.  In 
general,  we  consider  the  set  1(H)  of  generalized  inverses2 
of  the  crosscorrelation  matrix  H  and  analyze  the  proper¬ 
ties  of  the  detector 


i  =  sgn  H'y,  (3.1) 


which  we  refer  to  as  a  decorrelating  detector. 

The  £th  user  asymptotic  efficiency  achieved  by  a  gen¬ 
eral  linear  transformation  T  can  be  obtained  in  a  way 
similar  to  that  of  the  efficiency  of  the  conventional  single- 
user  detector  T  - 1  (Section  II).  The  first  step  is  to  find 
the  bit  error  probability  of  the  A:th  user: 

^  =  />[^=i|^  =  -i]  =  />[(r//x+r«)fc>0|^  =  -i] 


=  p 


( 7n ) *  >  ( TH )kk~  £  (TH)kjXj 

j*k 


*€( 
•**  ' 


■1,1}* 
■  -1 


(Tn)k>(TH) 


kk 


I  (TH)kjXj  ■ 
J  +  k 

(3.2) 


Since  the  random  variable  (Tn)k  is  Gaussian  with  zero 
mean  and  variance  e.qual  to  (THTr)kko2,  the  sum  in  (3.2) 
is  dominated  as  a  -*  0  by  the  term 

2I-*e|((Ttf)**-  Z\(TH)kj\jlo^(THTT)k^.  (3.3) 


Hence,  according  to  definition  (1.1),  the  £th  user  asymp¬ 
totic  efficiency  achieved  by  tiie  linear  mapping  T  is 


Vk(T)  =  max2{0, 


(TH)kk-  Z\ (TH)kj\y 
j+k 


j"k  \j(THTr) 


(3.4) 


kk 


Thus  the  £th  user  asymptotic  efficiency  of  a  decorrelating 
detector  with  matrix  H‘  is  given  by 

(HlH)ku-  £{ [H'H)kj\\ 
j  +  k  _ l 

J(H'HH>T)kk  |‘ 

(3  5) 


i\k(Hl)  =  max2(0, 


f>k 


Proposition  2:  If  user  k  is  linearly  independent  every 
H1  e  1(H)  satisfies 

rik(H')=l/R+kk.  (3.6) 

Thus  for  independent  users  the  asymptotic  efficiency  of 
the  decorrelating  detector  is  independent  of  the  energy  of 
other  users  and  of  the  specific  generalized  inverse  selected. 


Proof:  If  user  k  is  linearly  independent,  we  estab¬ 
lished,  in  Lemma  1  that  (HlH)k,  =  Sr,.  Hence  it  follows 
from  (3.5)  that  '  ' 


Vk(H') 


1 

WkHkk 


(3.7) 


Using  the  defining  properties  of  generalized  inverses 
(see  footnote  2)  it  is  easy  to  check  that  if  Ael(R), 
then  W~l/2AW~1/2e  1(H),  and  if  B  e  1(H),  then 
W{/1BWl/2  e  I(R).  Hence  there  is  an  obvious  bijection 
between  I(R)  and  1(H).  Note  that  H+  need  not  be  the 
image  of  R+  in  this  bijection.  However,  the  inverse  image 
of  H+,  say  R*  €  I(R),  satisfies 

=  wk(n '-l/2R*w~'/2)kk  =  Rtk •  (3-8) 

Moreover,  since  user  k  is  linearly  independent,  Lemma  1 
implies  that  the  denominator  of  (3.7)  is  equal  to  the  left 
side  of  (3.8)  and  that  the  right  side  of  (3.8)  is  equal  to  Rfk. 
Proposition  2  follows. 


In  Section  IV  it  is  shown  that  if  user  k  is  linearly 
dependent,  then 

vt=  sup  vk(Hr)=  sup  T//t(7’)=Tj,i, 

It' e  1(H)  TeRKx* 

i.e.,  the  best  decorrelating  detector  and  the  best  linear 
detector  achieve  the  same  &th  user  asymptotic  efficiency. 

Proposition  3:  The  near-far  resistance  of  the  decorrelat¬ 
ing  detector  equals  that  of  the  optimum  multiuser  detector, 
i.e.,  for  all  H’el(H), 

inf  Vk(Hr)=  inf  vksVk-  (3.9) 

wy  ^  0  Y)j  a  0 

j+k  j* k 

Proof:  If  user  k  is  linearly  independent,  then  accord¬ 
ing  to  Proposition  1  the  near-far  resistance  of  the  opti¬ 
mum  detector  is  equal  to  the  asymptotic  efficiency  of  the 
decorrelating  detector  (Proposition  2),  which  is  indepen¬ 
dent  of  the  energy  of  the  other  users.  If  user  k  is  linearly 
dependent,  Proposition  1  states  that  the  near-far  resis¬ 
tance  of  the  optimum  detector  is  zero,  and  hence  the  same 
is  true  for  any  detector. 


The  result  of  Proposition  3  is  of  special  importance  in  a 
near-far  environment,  where  the  received  signals  have 
different  energies  and  where  the  energy  ratios  may  vary 
continuously  over  a  broad  scale  if  the  positions  of  the 
users  evolve  dynamically.  In  this  environment  any  decorre¬ 
lating  detector,  with  its  linear  time-complexity  per  bit, 
offers  the  same  near-far  resistance  as  the  optimum  mul¬ 
tiuser  detector,  whose  time-complexity  per  bit  is  exponen¬ 
tial. 
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For  the  case  where  the  signal  set  is  independent,  i.e.,  H 
is  nonsingular  (and  t^  =  ij k(H~l)  is  energy-independent 
for  all  users),  a  geometric  explanation  for  the  equality  of 
y\k  and  -qf  can  be  given  in  the  two-user  case.  Recall  that 
the  received  signal  y  satisfies:  y  =  Hx  +  it  and  the  noise 
autocovariance  matrix  is  H.  To  have  spherically  symmetric 
noise,  it  is  convenient  to  work  in  the  H~x/2y  domain.  Here 
the  hypotheses,  denoted  by  A,B,C,D  in  Fig.  1,  are  at  the 
points  Hl/2x,  with  x  e  {-1,1}2.  Since  in  this  domain  the 
matched  filter  output  noise  is  spherically  symmetric  and 
Gaussian,  the  decision  regions  of  the  maximum  likelihood 
detector,  determined  by  the  minimum  Euclidean  distance 
rule,  are  given  by  the  perpendicular  bisectors  of  the  seg¬ 
ments  between  the  different  hypotheses,  and  the  k  th  user 
asymptotic  efficiency  corresponds  to  the  square  of  half  the 
minimum  distance  between  distinct  hypotheses  differing  in 
the  &th  bit. 


Fig.  1.  Hypotheses  and  decision  regions  in  two-user  case. 


The  decision  regions  of  the  decorreiating  detector  are 
cones  with  a  vertex  at  the  origin,  such  that  application  of 
H~l  maps  them  to  the  coordinate  axes.  Thus  in  the 
H~l/2y- domain  the  decision  cones  pass  through  the  points 
Hl/2e,  with  e  the  unit  vectors  in  R2.  These  points  are  at 
the  center  of  the  sides  of  the  parallelogram  formed  by  the 
hypotheses,  because  the  unit  vectors  can  be  represented  as 
half  the  sum  of  adjacent  hypotheses.  So,  the  decorreiating 
detector  decision  boundaries  are  parallel  to  the  parallelo¬ 
gram  sides  and  intersect  it  at  the  centers  of  its  sides.  The 
£th  bit-error  probability  (by  symmetry  we  can  assume  that 
the  transmitted  bit  was  - 1)  is  the  sum  of  two  integrals, 
one  for  each  possibility  for  the  remaining  bit,  of  the  noise 
density  function  over  the  region  in  which  the  klh  bit  is 
decoded  as  1.  In  this  case  the  k  th  btt-error  probability  can 
be  easily  computed  by  taking  advantage  of  the  aforemen¬ 
tioned  properties.  To  this  end  we  rotate1  the  coordinate 
system  to  let  the  y  axis  coincide  with  the  /cth-bit  decision 
boundary  and  use  the  equal  distance  property  of  the 
decision  boundary  to  the  hypotheses,  to  observe  that  the 
two  integrals  are  equal.  We  then  use  the  spherical  symme¬ 
try  of  the  noise  to  identify  each  integral  as  a  ^-function  of 
the  distance  of  the  hypothesis  to  the  decision  boundary. 
Hence  the  A:  th  user  asymptotic  efficiency  of  the  decorreiat¬ 
ing  detector  is  equal  to  the  square  of  the  distance  of  any 


hypothesis  to  the  Arth  bit  decision  boundary.  Thus,  in  Fig. 
1,  fiH  is  the  length  of  the  shortest  of  the  segments  AM, 

AO  and  BO,  and  fitf  is  the  length  of  AP.  The  result  of 
Proposition  3  can  now  be  interpreted  as  follows.  Since  rj 
appears  as  the  hypotenuse  and  rf  as  the  leg  of  a  right- 
angled  triangle,  tj  is  lower-bounded  by  the  energy  indepen¬ 
dent  if.  However,  since  the  triangle  angles  vary  with 
increasing  energy  of  the  interfering  user  waveform,  there  is 
a  particular  energy  ratio  for  which  the  triangle  degenerates 
into  a  line  segment.  This  is  the  point  when  i)  reaches  its 
minimum  ij,  which  is  geometrically  identical  with  r\d.  For 
the  parallelogram  formed  by  the  hypotheses,  this  is  the 
case  where  a  diagonal  is  perpendicular  to  a  side  (e.g.,  AO 
perpendicular  to  CD). 


IV.  The  Optimum  Linear  Multiuser  Detector 


We  now  turn  to  the  question  of  finding  the  optimum 
linear  detector.  We  have  seen  that  this  is  a  fruitful  ap¬ 
proach,  since  a  particular  type  of  linear  detector,  the 
decorreiating  detector,  offered  a  substantial  improvement 
in  asymptotic  efficiency  compared  to  the  single-user  detec¬ 
tor,  while  its  near-far  resistance  equaled  that  of  the  opti¬ 
mum  multiuser  detector.  While  we  now  know  that  no 
detector,  linear  or  nonlinear,  can  outperform  the  decorre¬ 
iating  detector  with  respect  to  near-far  resistance,  for 
fixed  energies  it  is  indeed  possible  to  obtain  linear  detec¬ 
tors  that  have  a  higher  asymptotic  efficiency  than  the  one 
achieved  by  the  decorreiating  detector. 

We  find  the  linear  detector  which  maximizes  the  asymp¬ 
totic  efficiency  (or  equivalently  minimizes  the  probability 
of  bit  error  in  the  low-noise  region)  and  compare  the 
achieved  asymptotic  efficiency  to  the  ones  achieved  by  the 
conventional  and  optimal  detectors.  Thus  we  ask  which 
mapping  T:  RK  -*  RK  maximizes  the  asymptotic  efficiency 
of  the  decision  scheme 


i  =  sgn(7»  =sgn(77/jr  +  Tn).  (4.1) 

The  interpretation  of  this  optimization  problem  in  terms 
of  decision  regions  is  to  find  the  optimal  partition  of  the 
^-dimension  hypotheses  space  into  K  decision  cones  with 
vertices  at  the  origin.  The  surfaces  of  these  cones  deter¬ 
mine  the  columns  of  the  inverse  T~x  of  the  mapping 
sought.  Application  of  T  on  the  cone  configuration  will 
map  the  cones  on  quadrants,  after  which  a  sign  detector  is 
used. 

The  A  th  user  asymptotic  efficiency  of  a  general  linear 
detertor,  as  given  by  (4.1)  was  derived  in  (3.4): 


T}*(r)  =  max2<0, 


(TH)kk-  Ei(ra%l\ 

j*k  1 


fiTHT7) 


kk 


I' 


(4.2) 


The  best  linear  detector  has  the  asymptotic  efficiency 

V*  =  sup  vk(T).  (4.3) 

T  e  rKxK 

Hence  the  asymptotic  efficiency  of  the  best  linear  detector 
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is  equal  to 


t]‘k  =  sup  max2 

oS  rk 


hlv-  E  \hjvh 

i*k 
'J  vtHv 


where  v  denotes  the  Ath  row  of  T.  To  minimize  the 
probability  of  Pk,  we  have  to  maximize  the  argument  of 
the  Q-function,  and  equivalently  maximize  the  asymptotic 
efficiency  -!)*(«),  with  respect  to  the  components  of  the 
vector  v.  Since  the  map  applied  on  the  matched  filter 
outputs  is  linear,  the  asymptotic  efficiencies  of  all  the  users 
can  be  simultaneously  maximized,  each  such  maximization 
yielding  the  corresponding  row  of  the  map  to  be  applied. 
For  the  sake  of  clarity,  we  first  consider  the  two-user  case, 
for  which  explicit  expressions  for  the  maximum  linear 
asymptotic  efficiency  can  be  obtained. 


A.  The  Two-User  Case 

Throughout  this  subsection  we  denote  the  normalized 
crosscorrelation  between  the  signals  by  p  =  R  n.  We  first 
give  an  explicit  expression  for  the  optimum  linear  detector. 

Proposition  4:  The  k  th  user  optimal  linear  transforma¬ 
tion  Tk{  =  vTy  on  the  matched  filter  outputs  prior  to 
threshold  detection  is  given  by 


=  [l;  -sgnpmin 

(i.IpIK/w’,)172}] 

(4.6) 

_  j  [1;  -sgnpj, 

(wi/wic)1/2  ^  Ip! 

(4.7) 

w. 

otherwise 

where  bTk  is  the  Ath  row  of  the  decorrelating  detector  and 
(i,k)G{(  1,2), (2,1)}. 

Proof.  Without  loss  of  generality,  let  A  =1.  We  have 


H  = 


vM  = 


w. 


w. 


or=  [1;  £72] 


(4.8) 


Pf> l 

P\jwxw2  w2 
1  h\v-\hT2v\ 

'/vTHv 

1  +  P ( w2/ w, y 1/2 v2  -  |p ( w2/w\ )xn  +  ( w2/wx )  u2| 


\Jl  +  2p(wz/wl)1/2v2  +  (w2/wl)vl 


(4.9) 


and  the  objective  is  to  maximize  the  nght  side  of  (4.9)  with 
respect  to  v2.  We  consider  the  case  |p|  =1  separately. 


a)  Case  |p|  *  1:  Introduce  an  indicator  function  for  the 
absolute  value  term: 

f  I  P  +  (w’2/tvl)t/I‘,2>0 

H-i,  p+K/'-.y'V o-  (4-,0) 

lo,  else 

Then 

dVi _ (l-p2)(w2/w’i) _ 

du2  (l  +  2p{w2/wl)l/2v2  +  (w1/w[)v12f/2 

(4.11) 

Therefore,  we  should  take  v2  =  -  /  when  this  is  consistent 
with  the  definition  of  I  as  a  function  of  u2.  Thus 

f  1,  if  /=  -1 »  0  <  {w2/wiY/2  <  “  P 

\  —  1,  if  7  =  1  «=>  0  <  (w2/w,)1/2  <p 

(4.12) 

As  can  easily  be  seen,  both  values  correspond  to  maxima. 
If  neither  of  these  conditions  is  met,  the  derivative  does 
not  have  a  zero.  The  optimal  value  for  v2  can  be  deter¬ 
mined  by  taking  a  closer  look  at  the  behavior  of  dt)x/dv2 , 
in  Fig.  2. 


For  both  I -  l  and  I- -l,  the  derivative  of  ij!  is 
positive  for  v2  smaller  than  the  abscissa  of  the  zero  of  the 
derivative  (which  is  equal  to  - 1),  and  negative  after¬ 
wards.  Due  to  the  nonlinearity  of  i]l  the  derivative  has  the 
form  corresponding  to  /  =  -1  for  u2<  -  p(w2/w1)l/ 2  and 
the  form  corresponding  to  /  =  1  afterwards.  Since  the 
second  branch  (for  7  =  1)  turns  negative  before  the  first 
one,  we  have  to  take  the  largest  value  of  v2  yielding  a 
positive  derivative  on  the  first  branch.  It  can  easily  be  seen 
that  in  the  “no-zero”  case,  -1  <  -  p(w2/h'1)1/2  <1,  this  is 
the  point  of  discontinuity,  i.e.,  o2  =  -  p(w2/w1)1/2.  Note 
that  for  p  =  0  we  get  vT  -  [1;  0],  the  identity  transforma¬ 
tion,  as  expected,  since  the  users  are  then  decoupled  and  a 
single-user  detector  is  optimal.  By  taking  the  inverse  of  R 
we  also  see  that  in  the  no-zero  case  the  optimal  transfor¬ 
mation  vector  is  exactly  the  corresponding  row  of  the 
inverse  correlation  matrix. 
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b)  Case  |p|  =  I:  Equation  (4.9)  becomes 

IxM  =  sgn(l  +  sgnp(H'2/w1)1/2t>2)-(w2/w1)I/2. 

(4.13) 

We  see  that,  for  (w2/wl)l/2  <1,  any  u2  satisfying  u2  sgn  p 
>  -(wl/w2)l/2  is  optimal,  in  particular  the  one  given  in 
(4.7).  Otherwise,  the  asymptotic  efficiency  of  the  best 
linear  transformation  is  zero,  hence  all  linear  transforma¬ 
tions  are  equivalent. 

Substituting  the  result  of  Proposition  4  into  the  asymp¬ 
totic  efficiency  of  (4.9),  we  obtain  the  following 
Proposition  5:  The  k  th  user  asymptotic  efficiency  of  the 
optimal  linear  two-user  detector  equals 

^  =  h-2\P\(w,/wk)l/2+  w,/wk,  if(w,/H>J1/2<;|pl 
(l-p2,  .  otherwise 

(4.14) 


the  second  term  outweighs  the  second  part  of  the  first 
term,  so  the  best  one  can  do  is  to  eliminate  it  by  choosing 
v2  =  -  p(w{/w2)w2  (the  decorrelating  detector).  Since  this 
minimizes  the  noise  variance  at  the  same  time,  it  is  the  best 
strategy  in  this  region.  If,  however,  .(w2/w!)l/2  <  |p|,  and 
u2  is  such  that  the  term  p(p  +  w2(*v2/wi),/il  is  positive,  it 
is  a  better  policy  to  allow  interference  from  user  2,  which 
is  compensated  by  the  second  part  in  the  first  term,  and 
use  the  residual  positive  contribution  in  the  first  term  to 
increase  the  SNR  as  compared  to  the  decorrelating  case. 
We  have  seen  that  this  strategy  leads  to  the  same  perfor¬ 
mance  as  the  more  complex  maximum  likelihood  detector. 

Note  that  in  the  two-user  case  the  signal  energies  and 
cross  correlations  cannot  be  picked  so  as  to  allow  both 
users  optimal  performance  at  the  same  time,  for  user  1  we 
need  {w2/wx)x/2  <  |p|  <1,  whereas  for  user  2  we  need 
(w2/wx)l/2  >  l/|p|  >  1. 

B.  The  K-User  Case 


for  (i,  k)  e  {(1,2), (2,1)}. 

The  Ath  user  asymptotic  efficiency  obtained  in  the 
range  (w,/wk)1  n  <  |p[  equals  the  optimum  asymptotic  effi¬ 
ciency,  obtained  in  (2.12).  Even  outside  the  region  of 
optimality,  the  best  linear  detector  shows  a  far  better 
performance  than  the  conventional  single-user  detector 
(see  Fig.  3),  since  if  wi/wk  >  p2,  then  i\\  is  independent  of 
w,/wk,  whereas  according  to  (2.7)  the  asymptotic  effi¬ 
ciency  of  the  conventional  detector  is  equal  to  zero  for 
w,/wk  >  1/p2. 


ASYMPTOTIC  '  . OPTIMUM  MULTIUSER  OETECTOR 

EUSeR°!CY  - OECORRELATING  OETECTOR 

- CONVENTIONAL  SINGLE-USER  OETECTOR 


Unlike  Propositions  2  and  5,  in  the  general  /f-user  case 
it  is  not  feasible  to  obtain  an  explicit  expression  for  the 
asymptotic  efficiency  achieved  by  the  best  linear  detector 
Proposition  6.  The  A;  th  user  asymptotic  efficiency  of  the 
best  linear  detector  equals: 


i)'k  =  —  max2(0,  max  rj(e) 

wk  \  *,€{-1.1} 

j*k 


T](e)  =  max  vjHv 
rk 
JHo-  i 

tjkjv  £  0 
j  +  k 


(4.16a) 


(4.16b) 


where  the  /  th  component  of  v0  is  equal  to 

/  \  i~ev 


Fig.  3.  Asymptotic  efficiencies  in  two-user  case  (p  -0.6). 

There  is  an  intuitive  interpretation  of  the  dual  behavior 
of  the  best  linear  detector  and  of  the  boundary  point 
(wi/wk)l/1  ~  IpI-  Let  k  =  \.  The  input  to  the  threshold 
device  corresponding  to  the  first  user,  zx  =  vTy,  has  three 
components: 

h  =  wi  [(1  ~  P2)  +  p(p  +  vi(w2/wi)l/2)\  *i 

+  wx[(w2/wi)l/2(p  +  v2{w2/Wl)l/2)\x2  +  h  (4.15) 

where  h  is  a  Gaussian  random  variable  of  variance 
w1o2[(l  -  p2)  +  (p  +  o2(w2/h'1)1/2)2].  For  (w2/wi)1/2  >  |p|, 


Then  the  maximum  i)(e)  is  achieved  for  6  such  that 

+  E  Va 

- _ J*k _ 

V  (  \ 1/2  ’ 
[vjHVo  +  vlH  Z  VA 

\  L.  / 


(“y)i" 


o,  i  *  j 

1.  i  =  j 

(4.17) 

>  0  for  j*k 

(4.18) 

hTjV  #  0  =»  \j  =  0 

(4.19) 

Xy^O,  j¥=k. 

(4.20) 

Proof:  Let 


Sf  =  { x<=RK:hJx>0 } 
Sj~  =  {xsRK:h^x^0}. 


(4.21) 
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From  (4.5)  we  seek 


sup  —=== 
vsRK  V  vtHv 


E  l^J^I 

J  +  k 


=  max  sup  j==,-\h[ V-  £  !/»>)  (4.22) 

f e  ( —  1  •  1 )  v er>s;>  yvTHv\  ,  +  k  ' 

'**  s 

=  max  y{e),  with  -n(e) 

0  e  {  - 1  •  1 } 

J  +  k 

■  SUP  "(**”-  (4-23) 

wens;/  y  v‘ Hv  \  ,  I 

J 

From  *he  definition  of  q,  we  see  that  the  term  m  parenthe¬ 
ses  equals  VqHv.  Now  d€  CijSf  «=»  ^/rju>  0,  and 

since  tj*  is  invariant  to  scaling  of  v,  maximization  of  the 
given  functional  over  RK  is  equivalent  to  maximization 
over  the  ellipsoid  vTHv  =  1. 

This  proves  the  first  part  of  Proposition  6.  We  now  have 
to  perform  two  maximizations  where  the  second  one  has 
the  explicit  form  of  an  exhaustive  search.  We  turn  our 
attention  to  the  inner  maximization  in  (4.16).  We  first 
show  that  it  is  possible  to  replace  the  feasible  set  therein 
by  an  equivalent  convex  set,  i.e.,  the  asymptotic  efficiency 
is  unchanged  if  we  replace 


Since  this  is  a  mi  .imization  problem  of  a  continuous 
convex  function  ~>n  a  compact  convex  set,  it  achieves  a 
unique  minimum  on  the  set.  Since  all  the  functions  are 
differentiable,  we  can  apply  the  Kuhn-Tucker  conditions 
(e.g.,  [4]),  to  get  from  condition  (1), 


-  Hv0  +  \02Hv  -  £  A jejhj  -  0, 

j  +  k 


hence 


fi=TT-(n>+  E  Va)  (4-27) 

ZAo'  j  +  k  > 

with  us  the  y'th  unit  vector,  as  defined  before.  Equations 
(4.18)  and  (4.19)  result  fror  the  Kuhn-Tucker  conditions, 
condition  (4.20)  expresses  the  nonnegativity  requirement 
for  the  There  is  one  more  constraint  to  satisfy,  which  is 
vtHv  =  1 : 

1  ,  \  vf Hv 

1  ■  *r*-  wM111*  £V/*I*)  ~ix;- 


We  used  condition  (4.19)  to  get  the  last  equality,  so 


2\0  =  vfHv=*  y{e), 


(4.28) 


sup  vjHvby  ij(e)  =  sup  o*Ho.  (4.24)  ancj  since 


o  e  rk 

or//f-l 
e,hjv  a  0 


oe  Rk 
0*7/0  si 
e,hjv  a  0 

J  +  k 


—ylHv°  + 


Z  ^jfyl 

j*k  ’ 


To  show  (4.24),  let  y-  Hl/2 v,  zj=  yth  row  of  Hl/2.  It 
then  follows  that  hTsv  =  zjy,  vjHl/2  =  yf,  vTHv  —  yTy  = 
\y\\  and 

y(e)=  sup  yary=  sup  |j^||jt|cosa  (4.25) 


ye  rk 

yeRx 

l>i-l 

|.y|~l 

e,vr>'*° 

t/ijy  a.  o 

J  +  k 

j  +  k 

where  a  is  the  angle  between  the  vectors  y0  and  y.  Since 
the  inequality  constraints  are  linear  and  partition  the  space 
into  convex  cones  with  vertex  at  the  origin,  the  optimal 
angle  a  is  independent  of  |  y\.  Either  the  optimal  cos  a  is 
nonnegative,  in  which  case  y(e)  is  maximized  for  |.y| 
maximal  in  both  versions,  or  it  is  negative,  in  which  case 
i?(e)  <0.  In  either  case,  the  value  of  ifk,  which  involves 
comparison  with  zero,  is  unchanged  if  the  maximization  is 
performed  over  the  interior  of  the  ellipsoid,  which  com¬ 
pletes  the  proof  of  the  claim. 

We  now  have  to  consider  the  following  problem: 

y(e)-  inf  -vfHv.  (4.26) 

oS  rk 
</7/o-iso 

-  CjhJo  s  0 
j  +  k 


we  get 

/  \!/2 
2  +  • 

'  j  +  k  > 

This  together  with  (4.27)  completes  the  proof  of  Proposi¬ 
tion  6. 

In  Appendix  II  we  show  an  explicit  procedure  for  find¬ 
ing  the  best  linear  detector  characterized  in  Proposition  6. 
Its  asymptotic  efficiency  is  trivially  upper-  and  lower- 
bounded  by  that  of  the  optimum  and  decorrelating  de¬ 
tectors,  respectively.  For  certain  values  of  energies  and 
crosscorrelations  these  bounds  are  attained;  sufficient  con¬ 
ditions  for  this  to  occur  are  given  in  Propositions  7  and  8. 

Proposition  7/  The  following  are  sufficient  conditions  on 
the  signal  energies  and  crosscorrelations  for  the  best  linear 
detector  to  achieve  optimal  &th  user  asymptotic  efficiency: 


max 

-l.—  .K 


(4.29) 


Proof:  In  the  optimality  case,  we  show  in  Appendix  II 
that  eshT]V0  >  0  for  all  j  *  k.  If  we  introduce  ek  =  1  this  has 
to  hold  also  for  j  =  k,  otherwise  we  get  negative  asymp- 
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totic  efficiency.  We  can  rewrite  these  conditions  as 
DHD(- 1-1  •••  1 - l)r  = 


Hn  ele2^l2 

ele2^2\  H22 


e\eK^Kl  eKe2^K2 


e.tf.A 

e2^2k 

eK^Kk 


e^KHXK 

e2  eK^2K 

Hkk 


(4.30) 


where  D  is  the  diagonal  matrix  with  /  th  diagonal  element 
equal  to  e,.  We  now  see  that  a  sufficient  condition  for  the 
above  inequality  to  hold  for  some  ex,---,eK  is 

Wjk\>  j  =  \,‘”.K. 

i  *  k 

The  corresponding  <?y  are  eJ  =  sgn  Hjk.  Hence  (4.29)  fol¬ 
lows  by  replacing  HIJ  by  Jw,Wj  Rir 


i.e.,  for  a  dependent  user  the  best  decorrelating  detector 
has  the  same  asymptotic  efficiency  as  the  best  linear  detec¬ 
tor. 

Proof:  Recall  the  bijection  between  /(/?)  and  1(H) 
established  in  Section  III.  Let  R'  denote  the  image  of  H1 
under  this  bijection.  Then,  using  (3.5)  for  the  last  equality, 
we  can  write 


Note  that  the  above  condition  can  be  satisfied  by  only 
one  user  because 

fk>Fj/\Rkj\>Fi,  for  all  j. 

Proposition  8:  If  user  k  is  linearly  independent,  the 
following  condition  is  sufficient  for  the  kth  row  of  the 
decorrelating  detector  H1  e  1(H)  to  be  the  best  A:  th  user 
linear  detector  for  a  given  set  of  signal  energies  and 
crosscorrelations: 

I Hjk\zHlkt  for  all  j*k.  (4.31) 

Proof:  We  showed  that  in  the  terminal  case  v  = 
h[/\)H‘kk  is  a  maximizing  vector  for  vfHv,  hence  there  are 
nonnegative  Kuhn-Tucker  multipliers  \k,  such  that,  with 
(4.27)  and  (4.28), 


or 

=[(*!- IK, -..,1,  •••,(A*-lK]r 

Hkk 

SO 

=  1  +  ejHjk/Hkk ,  j  +  k.  (4.32) 

Hence  (4.31)  is  sufficient  to  ensure  \j  >  0  regardless  of 
(eit  i*k). 

Note  that  in  the  two-user  case,  Proposition  5  implies 
that  the  sufficient  conditions  found  in  Propositions  7  and 
8  are  also  necessary. 

Proposition  9:  If  user  k  is  linearly  dependent,  then 
% ?*=  sup  t ?Jt(H/)=  sup  7?*(r)=i)'*  (4.33) 

H'el(H)  TeRKXK 


Vdk=  sup  v  *(/?')»  sup  1 ]k(W-^R'W-^) 

//'e  /(//)  R'el(R) 


=  max2  (0,  sup 

R1  e  HR) 


\ fw,  . 

(R'R)kk-  L\(R'R)kj\j=) 

vH  \ 


i*k 


\I{r'rr,t) 


kk 


(4.34) 


Since  R  is  nonnegative  definite  of  rank  r,  it  can  be  written 
using  its  orthonormal  eigenvector  matrix  T  and  the  r  x  r 
diagonal  matrix  A  of  nonzero  eigenvalues  of  R,  as 


R 


(4.35) 


Then  (cf.  [10]),  Rl  is  a  generalized  inverse  of  R  if  and  only 
if,  for  some  matrices  U  and  V  of  appropriate  dimensions,  it 
can  be  written  as 


/?'  = 


V 

UAV 


(4.36) 


Hence,  using  the  corresponding  partition  of  T,  we  can 
write 


{R'R)kj  =  uTk[TxT2] 

1 - 1 

o  o 

uj 

=  ul{Tj{+TfJAT?)U] 

(4.37) 

(R,RR'T)kk  =  ul[TlT1\ 

. 1 

Si 

i _ i 

V 

Jl. 

uk 

=  «r(7’iA"1: 

Tf  +  T-JUTf 

+  TXUT?  +  T2UAUTT2r)uk  (4.38) 


7}*  =  max2(0,  sup 


fij 


ulfa  +  TJJAWu,-  Z  \ul(Ti  +  T2UA)TiTUj\j=r) 

_ jjjk _ ( 


t/6  JuTk{Tx  +  TJJA)A-\TX  +  TjJA)Tuk 


(4.39) 
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Since  user  k  is  dependent,  «[r2,  whose  components  are 
the  k  th  components  of  the  eigenvectors  to  eigenvalue  zero, 
is  nonzero.  (Otherwise,  for  all  x  with  Rx  =  0,  xk  would  be 
zero,  which  implies  that  the  k  th  user  is  linearly  indepen¬ 
dent  of  the  other  users.)  Hence  since  A  is  invertible,  we 
can  make  the  change  of  variables 


to  get 


tfk  =  max2(0,  sup 


x  =  (Tl  +  T2UAfuk 

xTr?»k  -  £  \*TTr«M 

j  +  k  iWk 


(4.40) 


/ r ^ 


)■  (4.41) 


Using  the  same  reasoning  as  in  the  proof  of  Proposition  6 
for  the  best  linear  detector,  we  can  write 


7}^  =  max2(0,  max 


1 


sup 


e;6(-l.l}  ISR,  W, 

J*k  x‘K~'x  - 1 
ejxrT^uj  a  0 
j  +  k 


—  vt0Wx/2Txx 


(4.42) 


where  the  i  th  component  of  v0  is  equal  to 


/  #  k 
i  =  k' 


with 


i]i  =  —  max2(o,  max  i|j[(«))  (4.43a) 

wk  \  ^e(-U)  j 
j+k 

■qf{e)  =  '  sup  Wx/2Txx  (4.43b) 

iSR' 


ejx^z  0 
j+k 


We  show  that  x*  =  A7’1rW/l/2o*  is  feasible,  and  that 
vr0Wx^Txx*  =  r(k{e): 

ejX*TT[uj  -  ejv*TWx/2TxATxuJ 

=  ejv*TH'x/2RWx/2lV-'/2uJ 

=  —ejv*THu 

1 

=  ^=eiv*Thl>0  (4.45) 

h 

since  v*  is  feasible, 

xmTA~xx*  =  v*TlVx'r2TlAA~lATflVx/2v* 

=  c*THv*  =  l.  (4.46) 

Hence  x*  is  feasible,  and 

vt  iy\/tTiX*  =  ct  w^2TxATl  Wx/\* 

=  dJ//d*  =  t,'(c).  (4.47) 

We  know  that  i)k  ^  T)’k,  since  the  decorrelating  detector 
belongs  to  the  class  of  linear  detectors.  We  exhibited  for 
each  e  a  feasible  vector  x*,  which  satisfied  vlWl/2Txx*  = 
7> lk(e).  Since  from  (4.43),  t idk(e)  >  vJWx/2Txx  for  all  feasi¬ 
ble  x,  we  have,  for  all  e,  T)dk(e)  £  ij*(<?).  Hence  T)k  >  t]‘k, 
which  establishes  (4.33). 

Since  the  kth  user  asymptotic  efficiency  depends  only 
on  the  kth  row  of  the  applied  linear  transformation, 
optimization  of  ijJt(/f/)  over  the  class  of  generalized  in¬ 
verses  for  each  dependent  user  k,  yields  different  rows, 
each  belonging  to  a  different  generalized  inverse.  Conse¬ 
quently,  the  collection  of  the  K  optimal  rows  need  not  be  a 
generalized  inverse. 

Finally,  notice  that  the  near-far  resistance  of  the  opti¬ 
mum  linear  detector  is  equal  to  that  of  the  optimum 
detector,  since  it  is  shown  in  Proposition  3  that  a  particu¬ 
lar  type  of  linear  detector,  namely,  the  decorrelating  detec¬ 
tor,  achieves  optimum  near-far  resistance. 


whereas  the  A:  th  user  asymptotic  efficiency  of  the  best 
linear  detector  equals  (cf.  (4.16)), 


with 


Let 


J  - 

i)k  - 


max 


Wl 


0,  max  r?l(e) 
',*{-1.1} 
j  +  k 


v[(e)  =  sup  vT0Hv. 

oe  Rk 
ejhju  a  0 

j  +  k 


v*  e  argi]k(e)  -  arg  max  cj Hv.  (4.44) 

t>e  rk 
Jlh  "■  1 

fyAyCaO 

j+k 


V.  Conclusion 

The  main  contribution  of  this  paper  is  the  establishment 
of  the  fact  that  a  set  of  appropriately  chosen  memoryless 
linear  transformations  on  the  outputs  of  a  matched  filter 
bank  exhibits  z  substantially  higher  performance  than  the 
conventional  single-user  detector,  while  maintaining  a 
comparable  ease  of  computation.  Moreover,  the  near-far 
resistance  of  all  proposed  detectors  is  shown  to  equal  that 
of  the  optimum  multiuser  detector. 

Even  though  the  worst-case  complexity  of  the  algorithm 
used  to  find  the  best  linear  detector  is  exponential  in  the 
number  of  users,  in  a  fixed-energy  environment  this  com¬ 
putation  needs  to  be  carried  out  only  once;  hence  the 
real-time  time-complexity  per  bit  is  linear,  in  contrast  to 
the  optimum  multiuser  detector.  Moreover,  a  region  of 
signal  energies  and  crosscorrelations  exists  in  which  the 
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optimal  linear  detector  achieves  optimum  asymptotic  effi¬ 
ciency. 

The  decorrelating  detector  is  easier  to  compute  than  the 
optimum  linear  detector,  and  it  exhibits  either  the  same  or 
quite  similar  performance,  depending  on  the  energies  and 
correlations.  Since  the  decorrelating  detector  does  not  re¬ 
quire  knowledge  of  the  transmitters’  energies  and  it 
achieves  the  highest  possible  degree  of  near-far  resistance, 
it  is  an  attractive  alternative  to  the  optimum  detector  in 
situations  where  the  received  energies  are  not  fixed.  The 
only  requirement  for  the  signal  of  a  user  to  be  detected 
reliably  by  the  decorrelating  detector  regardless  of  the  level 
of  multiple-access  interference,  is  that  it  does  not  belong  to 
the  subspace  spanned  by  the  other  signals— a  mild  con¬ 
straint  that  should  be  compared  to  the  condition  necessary 
for  reliable  detection  by  the  conventional  single-user  detec¬ 
tor,  i.e.,  that  the  signal  is  orthogonal  to  all  the  other 
signals. 

The  most  interesting  generalization  of  the  results  of  this 
paper  is  the  asynchronous  code-division  multiple-access 
channel.4  Due  to  the  fact  that  in  the  asynchronous  case  the 
channel  has  memory,  a  tf-input  AT-output  linear  discrete¬ 
time  filter  will  replace  the  memoryless  linear  transforma¬ 
tion  studied  in  this  paper. 

Appendix  I 

This  appendix  gives  a  summary  of  the  results  in  [18].  We  show 
that  the  problems  of  optimum  multiuser  demodulaUon  and  solv¬ 
ing  for  the  maximum  asymptotic  efficiency  are  nondetermimsUc 
polynomial  time  hard  (NP-bard)  in  the  number  of  users  and 
therefore  do  not  admit  polynomial  time  algorithms  unless  such 
algorithms  are  found  for  a  large  class  of  well-known  combinato¬ 
rial  problems  including  the  traveling  salesman  and  integer  linear 
programming.  According  to  (2.5),  the  selection  of  the  most  likely 
hypothesis  given  the  observations  is  the  following  combinatorial 
optimization  problem. 

MULTIUSER  DETECTION— 

Instance:  Given  K  e  Z*,  yeQK  and  a  nonnegative  defi¬ 
nite  matrix  H  eQKxK; 

Find  {b*  e  {-1,1}*}  that  maximizes  2bTy  -  brHb. 

Proposition  10.  MULTIUSER  DETECTION  is  NP-hard. 

Proof:  The  proof  of  NP-hardness  of  MULTIUSER  DETECTION 
can  be  carried  out  by  direct  transformation  from  the  following 
NP-complete  problem  [15]. 

PARTITION— 

Instance:  Given  /.  eZ+,  { /,  eZ+ ,  i “1,-  •  ■,  L}\ 

Question:  Is  there  a  subset  7  c  {1, •••,£,}  such  that 

e  /A  ”  e  /A? 

Given  /,,• ••,,/,  we  choose  the  following  instance  of 
MULTIUSER  DETECTION: 

K-L 
b/j  "  A  A 
A" 0,  ' 

*Noie  added  in  proof:  This  has  now  been  accomplished  in  the  compan¬ 
ion  paper  [19],  using  a  different  approach. 


With  this  choice,  {/,,•  •  is  a  “yes”  instance  of  partition  if 
and  only  if 

max  2bTy-bTHb  =  0.  (A.l) 

Proposition  10  can  be  generalized  [2],  [18]  to  deal  with  arbi¬ 
trary  finite  alphabets  which  are  not  part  of  the  instance  (and 
hence  are  fixed)  of  multiuser  detection,  i.e.,  the  problem  is 
inherently  difficult  when  the  number  of  users  is  large,  regardless 
of  the  alphabet,  size.  It  is  an  open  problem  whether  multiuser 
detection  remains  NP-hard  when  H  is  restricted  to  be  Toeplitz. 
If  this  is  the  case,  then  it  can  be  shown  [18]  that  the  problem  of 
single-user  maximum  likelihood  detection  for  mtersymbol  inter¬ 
ference  channels  [17]  is  NP-hard  in  the  length  of  the  interference. 

The  usefulness  and  relevance  of  Proposition  10  stem  from  the 
fact  that  when  the  users  are  asynchronous,  the  cross  correlauons 
between  their  signals  are  unknown  a  priori  and  the  worst-ca  e 
computational  complexity  over  all  possible  mutual  offsets  is  the 
complexity  measure  of  interest  since  it  determines  the  maximum 
achievable  data  rate  in  the  absence  of  synchronism  among  the 
users.  Actually,  no  family  of  signature  signals  is  known  to  result 
in  optimum  demodulation  with  polynomial-in-A"  complexity  for 
all  possible  signal  offsets.  Thus  even  if  the  designer  of  the  signal 
constellation  were  to  include  as  a  design  criterion  the  complexity 
of  the  optimum  demodulator  in  addition  to  the  bit-error-rate 
performance  (which  dictates  signals  with  low  crosscorrelations), 
he  would  not  be  able  to  endow  the  signal  set  with  any  structure 
that  would  overcome  the  inherent  intractability  of  the  optimum 
asynchronous  demodulation  problem  for  all  possible  offsets. 

The  performance  analysis  of  the  optimum  receiver  for  arbitrary 
energies  and  crosscorrelations  is  also  inherently  hard.  According 
to  (2.11)  the  maximum  achievable  asymptotic  efficiency  is  ob¬ 
tained  as  the  solution  to  multiuser  asymptotic  efficiency. 

MULTIUSER  ASYMPTOTIC  EFFICIENCY — 

Instance:  Given  K  eZ+,  k  G  {1,-  •  •,  K),  and  a  nonnega¬ 
tive  definite  matrix  II  bQKxK; 

Find:  the  kth  user  maximum  asymptotic  efficiency, 

rik  =» —  min  iTHc. 
w*  <6  (-1,0,1)* 

€k  +  0 

Proposition  11:  multiuser  asymptotic  efficiency  is  NP-hard. 

Proof:  The  proof  is  divided  in  two  steps.  First,  -1/0/1 
knapsack  is  polynomially  transformed  to  multiuser  asymp¬ 
totic  efficiency.  Then,  -1/0/1  knapsack  is  shown  to  be 
NP-complete.  In  analogy  to  the  0/1  knapsack  problem  (e.g., 
[16])  we  define 

-1/0/1  KNAPSACK— 

Instance:  Given  L  e  Z+,  GeZ+  and  a  family  of  not  neces¬ 
sarily  distinct  positive  integers 

{/(62+,/»l,-,L); 

Question :  Are  there  integers  c,  e  (-1,0,1),  /  =•  1 ,  -  ■  ■,  L  such 
that  E/l  i</A  ”  6? 

We  transform  -1/0/1  knapsack  to  multiuser  asymptotic 
efficiency  by  adding  a  user.  Given  {(?,/,,•  •  7t },  denote 
and  construct  the  following  instance:  K=>  L  + 1,  k  =  L 
+ 1,  htj  -  flj,  1  <,  i,  j  £  K. 

The  K  th  user  asymptotic  efficiency  is  equal  to  zero  if  and  only 
if  { (7,  /,,--- ,  /,  )  is  a  “yes”  instance  of  -1/0/1  knapsack.  To 
see  this,  note  that  we  can  fix  ik  -  - 1  in  the  right  side  of  (2.11) 
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without  loss  of  generality.  Then, 

1  (  k-i  r  k  - 1 

riK^-p:  min  {  hKK  +  £  <„  ~2hnK  +  £  < „h„n, 

G  e,  6  (-1.0,1)  \  m-l 

lS/SAT-1 

-F,,.,”?,".,,!0-*?,''"'-)-  (A'2)  5 - 

‘S/SAT-1  /  T  \'/2 

The  proof  that  -1/0/1  knapsack  is  NP-complete  can  be  \  va^vo +  ,e,u' j 

found  in  [18]. 

Comment:  Recall  that  this  procedure  has  to  be  repeated  for  all 
APPENDIX  II  the  different  { }  in  search  of  the  maximal  tj(<?)  value,  until 

either  the  efficiency  ij(e)  reaches  the  upper  bound  given  by  the 
We  give  here  an  explicit  procedure  for  finding  the  maximizing  optimal  detector,  or  all  2K  possibilities  have  been  exhausted, 

vector  v  given  implicitly  by  Proposition  6.  The  idea  is  the  Prior  to  running  the  algorithm,  the  sufficient  conditions  given  in 

following:  condition  (4.19)  states  that  if  the  maximizing  vector  6  Propositions  8  and  9  should  be  checked, 

lies  in  the  intersection  of  a  subset  of  the  delimiting  hyperplanes 

with  equations  Ajt3  =  0,  j  e  S,  with  S  the  index  set  of  the  specific  Proof:  Conditions  (4.17)  and  (4.19)  are  obviously  sausfied 
hyperplanes,  only  the  \/t  jeS  are  possibly  nonzero  and  enter  by  construction  of  6  in  C,  and  the  requirement  h] 5  =  0  for  the 


Ajb “0,  is  B,  where 


is  a 


into  the  expression  defining  c.  Thus  we  have  |$|  equations  with 
|S|  unknowns,  which  we  can  solve  to  get  the  X,  and  then  6.  To 
state  (and  prove  the  correctness  of)  an  algorithm  that  finds  the 
optimum  linear  transformation,  the  following  terminology  is  used. 

Definition  1:  Let  S  be  an  index  set  •  •,/,}>  OSes 

K-l,  with  /,,•  •  •,  j„  e  {!,■  •  -,K }-{&},  labeled  in  increasing 


possibly  nonzero  X(  in  B.  To  prove  conditions  (4.18)  and  (4.20), 
consider  the  system  of  |JS|  linear  equations  in  |B|  unknowns  of  B. 
From  A  the  set  B  is  matched,  and  satisfies  CB(j)  -  yes  for  all 
j  4=k,  j  «£  S„.  We  have  to  show  a)  X,  a.0,  for  all  /  =1,2,-  •  •,  K; 
and  b)  CB(j) «  yes  for  aU  j  +  k,  j  <£  S„  is  equivalent  to  condi¬ 
tion  (4.18). 


order.  Define 

a)  X, 

=*  0,  i  £  B,  by  construction  of  the  index  set  S.  and  B.  For 

Hth 

i  e  B,  in  step  B  we  solve  A]d-  0,  all  /  — 1,2, •  •  >,|B|.  Let  |B|  =  n. 
Then 

Ds(J)  ”det 

hfo 

Hhh 

Shu 

.  (A.3) 

*y°o  +  VA  +  ‘  ‘ ® 

hlvo 

•••  3* 

hjiv°  +  +  ■  ■  •  +  =  ® 

Definition  2:  We  introduce 

an 

indicator  for 

the  second 

hl°o + hehHiJi +  •  ’  ’ +  Kej.Hu.  ”  °-  ( A-5) 

Kuhn-Tucker  condition: 

i(eJDs(j)>0,  then  Q(y)  =  yes,  else  Q(y)  =  no.  (A.4) 

Definition  3:  An  n-tuple  S  of  {1, -  •  • ,  K }-{£}  is  matched  if 
for  all  i  eS:  C$_ , (/)  =»  no. 

Definition  4:  An  n-tuple  S  contains  a  basis  B  if  (hj\j  e  5}  is 
a  basis  for  { Ajy  s  S}. 

Proposition  12:  The  following  algorithm  finds  a  vector  v  satis¬ 
fying  (4.17)-(4.2G). 

A.  Search  for  the  index  set  with  least  cardinality  S  c 
{1,-  ••,/(}-  [k),  for  which  i  eS,  are  possibly  nonzero 

n  :=0 

all  n-tuples  ==  untried;  Sa  ■=  matched 
WHILE  n  <,  K-2 

while  there  is  still  an  untried  n-tuple  containing  a  matched 
basis  B 

select  untried  matched  n-tuple  —  S„,  contained  matched 
basis  :=  B 

if  for  all  j<£Sn,j±k,  CB{j) «  yes,  return  S„,  B.stop 
ELSE  S„  ■■=  tried 
RETURN 
n  !=  n  + 1 
RETURN 

“decorrelating  detector  is  optimal,”  output  {2 ,---,K}-{k), 
STOP. 

B.  Computation  of  the  X/ 

/  <£  B:  X/  —  0 

i  <=  B\  X,  are  the  solutions  of  the  \B\  equations  |B|  unknowns 


Denote  by  Dg  the  determinant  of  the  coefficient  matrix  of  the 
Xy;e/(.  Since  B  is  a  basis  and  the  corresponding  matrix  is  nonneg¬ 
ative  definite,  DB  is  strictly  positive.  TTien,  by  Cramer’s  rule, 

(A,) 

UB 

The  numerator  is  obtained  by  i  row  flips  and  /  column  flips  to 
get  jj  into  position  (1,1).  Since  the  set  B  is  matched,  the 
numerator  is  nonnegative.  As  obtained  above,  the  denominator  is 
positive,  hence  X,  s  0  for  all  i  e  B.  This  completes  the  proof  of 
a). 

b)  hjv  -  0,  j  e  S„.  For  /€  S„,  j  =*  k,  with  the  obtained  values 
for  X  compute  the  feasibility  expressions: 

ejhjv-ejkj(v0  +  £  X,.e,„.} 

'  /ea  > 

'  i  6  B  > 

1 

“yr‘ji>B(j)>o,  (a. 7) 

UB 

since  CB(j)  -  yes.  The  last  equality  is  obtained  by  expanding 
along  the  first  row  of  DB(j).  This  completes  the  proof  of  b).  By 
construction  the  algorithm  terminates  after  at  most  K-2  steps. 

In  part  A  of  the  algorithm  notice  that  n  ><■  0  corresponds  to  a 
solution  in  the  interior  of  the  feasible  cone,  with  all  X  equal  to 
zero,  and  v  -  v0/]f  c£Hvg .  The  corresponding  asymptotic  effi¬ 
ciency  r?(e)/wk  » t%Hca/wk  »•  ij,  which  is  equal  to  the  asymp- 
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totic  efficiency  of  the  maximum  likelihood  detector  as  given  by 
(2.11).  On  the  other  hand,  n  -1  corresponds  to  a  solution  on 
exactly  one  of  the  delimiting  hyperplanes,  with  exactly  one  X 
nonzero  (call  it  X;),  and 


and 


(A.8) 


The  feasible  set  in  (A.10),  F°  (c|Wo  =  (1  /vk)uk ),  is  nonempty 
(e.g.,  it  contains  the  set  ((1  /uk)h[,H' s  1(H))),  and  for 
all  oeF,  vk  =  jHkk .  Hence  tj(e)  =1  /\jHkk ,  and  with  (3.8), 
(1/h^ )tj2(«)  *=1/Rfk,  which  is  the  energy  independent  asymp¬ 
totic  efficiency  of  the  decorrelating  detector  for  independent 
users. 


References 


(A.9) 

H1j 

The  asymptotic  efficiency  achieved  in  this  case  is  bounded  above 
by  the  one  for  n  -  0,  since  the  second  term  is  nonnegative.  If  the 
matrix  H  does  not  have  a  lot  of  structure,  which  is  to  be  expected 
in  practical  applications,  this  is  the  most  probable  case.  For 
increasing  n  the  computational  effort  grows  fast,  but  in  most 
cases  the  algorithm  will  terminate  for  very  small  n. 

We  also  have  an  explicit  solution  for  the  “terminal  case,” 
n  -  K  - 1,  which  corresponds  to  the  decorrelating  detector  case. 
Then,  without  loss  of  generality,  v=*h'k  /\jHkk,  a  scaled  version 
of  the  Jkth  column  of  any  generalized  inverse  matrix  of  H  (in 
particular  of  H * )  and  rj(e)  — 1  /Hkk ,  which  is  equal  to  the  kth 
user  asymptotic  efficiency  of  the  decorrelating  detector,  when  the 
scaling  factor  l/tvA  of  (4.16)  is  taken  into  account.  This  can  be 
shown  as  follows.  In  the  terminal  case  hjv -  0,  for  all  j  *  k. 
Hence 

i)(e) -  max  cjHo-  max  h[v 

oG  Rk  oGR* 

c/fo-1  Ajo-0 

j+k 

J*k  ^Afe-I 

If  user  k  is  dependent,  Ho  -  0  and  ij(e)  =»  0.  Since  this  was  the 
best  choice  of  o,  we  can  without  loss  of  generality  replace  6  by 
the  kth  row  of  any  generalized  inverse,  because  the  resulting 
asymptotic  efficiency  cannot  become  negative.  If  user  k  is  inde¬ 
pendent,  Lemma  1  implies  HH'uk  -  uk,  and  for  all  v  in  the 
feasible  set, 

,  1  , 
tf'tfo-— HV 
vk 

Hence  using  Lemma  1  for  both  equations,  we  obtain 
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Computational  Complexity  of  Optimum  Multiuser  Detection1 

Sergio  Verdu2 

Abstract.  Optimum  centralized  demodulation  of  the  independent  data  streams  transmitted  simul¬ 
taneously  by  several  users  through  a  Code  Division  Multiple-Access  channel  is  considered.  Each 
user  sends  an  arbitrary  assigned  signal  waveform,  which  is  linearly  modulated  by  symbols  drawn 
from  a  finite  alphabet.  If  the  users  are  asynchronous,  the  optimum  multiuser  detector  can  be 
implemented  by  a  Viterbi  algorithm  whose  time-complexity  is  linear  in  the  number  of  symbols 
transmitted  by  each  user  and  exponential  in  the  number  of  users.  It  is  shown  that  the  combinatorial 
problem  of  selecting  the  most  likely  transmitted  data  stream  given  the  sufficient  statistics  (sequence 
of  matched  filter  outputs),  and  the  signal  energies  and  cross-correlations  is  nondeterministic  poly¬ 
nomial-time  hard  (NP-hard)  in  the  number  of  users.  And  it  remains  so  even  if  the  users  arc  restricted 
to  be  symbol-synchronous. 

The  performance  analysis  of  optimum  multiuser  detection  in  terms  of  the  set  of  multiuser  asymptotic 
efficiencies  is  equivalent  to  the  computation  of  the  minimum  Euclidean  distance  between  any  pair 
of  distinct  multiuser  signals.  This  problem  is  also  shown  to  be  NP-hard  and  a  conjecture  on  a 
longstanding  open  problem  in  single  user  data  communication  theory  is  presented. 

Key  Words.  NP-complete,  Hypothesis  testing,  Code  Division  Multiple  Access,  Gaussian  communica¬ 
tion  channels,  Maximum-likelihood  sequence  detection. 

1.  Introduction.  The  purpose  of  hypothesis  testing  problems  is  to  select  a  so¬ 
lution  (decision)  from  among  a  finite  set  of  possible  solutions  (hypotheses). 
Typically,  the  number  of  hypotheses  is  small,  in  which  case  the  inherent  com¬ 
binatorial  optimization  nature  of  the  problem  does  not  play  any  role  and  the 
main  question  is  to  obtain  the  values  of  the  likelihood  function  or  other  finite- 
dimensional  set  of  sufficient  statistics.  In  this  paper  we  study  a  data  demodulation 
problem  where  the  reverse  situation  is  encountered:  it  is  straightforward  to  obtain 
a  set  of  scalar  sufficient  statistics  but  the  number  of  hypotheses  is  very  large. 

An  important  problem  arising  in  multipoint-to-point  digital  communication 
networks  (e.g.,  radio  networks,  local-area  networks,  and  uplink  satellite  channels) 
is  the  optimum  centralized  demodulation  of  the  information  sent  simultaneously 
by  several  users  through  a  Gaussian  multiple-access  channel.  Even  though  the 
users  may  not  employ  a  protocol  to  coordinate  their  transmission  epochs,  effective 
sharing  of  the  channel  is  possible  because  each  user  modulates  a  different 
signature  signal  waveform  which  is  known  by  the  intended  receiver  (Code 
Division  Multiple  Access  (CDMA)).  Recently  [1],  optimum  multiuser  detection 
has  been  shown  to  offer  important  gains  in  bit-error-rate  performance  over 
single-user  detectors,  which  are  conventionally  used  in  practice  and  neglect  the 
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presence  of  interfering  users.  The  optimum  multiuser  receiver  can  be  viewed  as 
a  bank  of  single-user  detectors  followed  by  a  common  algorithm  that  selects  the 
most  likely  transmitted  symbols.  The  structure  of  this  algorithm  depends  crucially 
on  whether  or  not  the  users  maintain  symbol  synchronism.  In  the  synchronous 
case,  it  is  enough  to  maximize  a  quadratic  function,  while  in  the  asynchronous 
case  the  way  to  get  rid  of  the  interference  among  the  users  is  to  employ  a  version 
of  the  Viterbi  forward  dynamic  programming  algorithm  [2],  whose  time- 
complexity  is  exponential  in  the  number  of  active  users.  It  is  shown  in  this  paper 
that  the  problem  is  nondeterministic  polynomial-time  hard  (NP-hard)  in  the 
number  of  users,  and  hence  there  exists  no  polynomial-time  algorithm  for 
optimum  multiuser  detection  unless  such  an  algorithm  is  found  for  a  large  class 
of  combinatorial  problems,  such  as  the  traveling  salesman  and  integer  linear 
programming  problems.  It  is  shown  that  the  problem  remains  NP-hard  even  in 
the  synchronous  case  despite  an  earlier  claim  of  existence  of  polynomial  solutions 
[3]  for  that  case. 

In  Section  2  the  multiple-access  channel  model  and  the  maximum-likelihood 
detection  problem  are  formulated  and  it  is  shown  that  optimum  multiuser  detec¬ 
tion  is  NP-hard  in  the  number  of  users.  In  Section  3  it  is  shown  that  the 
performance  analysis  of  optimum  multiuser  detectors  is  intrinsically  difficult  due 
to  the  fact  that  the  computation  of  the  minimum  distance  between  any  pair  of 
distinct  multiuser  signals  is  also  NP-hard.  Finally,  Section  4  summarizes  the  main 
points  of  the  paper,  discusses  suboptimum  alternatives,  and  presents  a  conjecture 
on  a  longstanding  open  problem  in  data  communication  theory. 


2.  Optimum  Multiuser  Detection.  Assume  each  of  K  users  transmits  indepen¬ 
dent  symbols  by  modulating  a  preassigned  waveform  from  a  signal  constellation 
{sit(0,  1 6  [0,  T],  k-  1,  •  •  • »  &}•  If  the  users  cooperate  to  maintain  symbol  syn¬ 
chronism,  the  receiver  observes  the  sum  of  the  modulated  signals  imbedded  in 
noise,  i.e., 

(1)  r(0=  I  bksk(t)  +  n(t), 

k-\ 


where  the  symbols  bk,  k  =  1, . . . ,  K,  are  drawn  by  each  user  from  a  finite  alphabet 
A.  A  reasonable  decision  rule  is  to  select  the  set  of  symbols  corresponding  to 
that  signal  among  the  possible  ones  that  resembles  most  closely  (in  a  mean-square 
sense)  the  received  waveform.  If  the  noise  is  Gaussian  and  white,  then  this  rule 
is  optimum  in  the  maximum-likelihood  sense.  If,  furthermore,  all  vectors  b  = 
{bu . . . ,  bK)e  Ak  are  a  priori  equiprobable,  then  the  minimum  distance  rule  gives 
the  maximum- a-posteriori  (MAP)  decision.  In  the  single-user  case,  this  detector 
is  implemented  by  comparing  the  output  of  a  matched  filter  with  a  set  of 
thresholds.  Analogously,  in. the  multiuser  problem  we  have 


(2) 
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where  y  =  O',, . . .  ,yK),  yk  =  \l sk(t)r(t)  dt,  i.e.,  yk  is  the  output  of  the  matched 
filter  of  the  fcth  user  signal  and  the  entries  of  the  nonnegative  definite  matrix  H 
are  given  by 


(3) 


h,j  = 


rr 


si{t)sj(t)  dt. 
o 


If  the  signal  waveforms  are  orthogonal,  then  H  is  a  diagonal  mrtrix  and  the 
maximization  in  (2)  decouples  into  K  single-user  problems.  Otherwise  (in  prac¬ 
tice,  there  may  be  bandwidth  or  complexity  constraints  that  prevent  the  designer 
from  choosing  an  orthogonal  signal  set),  a  combinatorial  algorithm  is  required 
’  to  solve  the  quadratic  optimization  (2)  over  the  finite  set  AK,  given  the  vector  of 
sufficient  statistics  y  and  the  signal  cross-correlations  H.  Since  the  set  of  quantities 
{bry,  be  A*}  can  be  computed  in  0(|A|K)  operations,  an  upper  bound  on  the 
time-complexity  per  bit  (TCB)3  required  to  solve  (2)  is  0(|A|K/ K  log| A|).  This 
is  the  best  available  upper  bound;  in  [3]  it  is  claimed  that  a  receiver  whose 
complexity  is  polynomial  in  the  number  of  users  (basically,  if  A  =  { — 1,  +1},  select 
the  sign  of  the  components  of  H"‘y)  is  optimal.  Unfortunately,  this  claim  is 
erroneous;  the  mistake  in  the  derivation  of  the  detector  is  commited  in  equation 
4  of  [3]  where  it  is  implicitly  assumed  that  the  symbols  put  out  by  the  detector 
are  uncorrelated  with  the  noise  component  of  the  matched  filter  outputs. 

More  significant  in  practical  applications  is  the  case  where  the  users  are 
mutually  asynchronous,  and  indeed  one  of  the  chief  advantages  of  CDMA 
over  other  channel  sharing  strategies  is  that  no  type  of  coordination  among  the 
users  is  required.  Now,  however,  (1)  is  no  longer  a  valid  model.  The  delays 
{rk,k  =  1, . . . ,  1C}  account  for  the  offsets  between  the  signaling  epochs  and  (1) 
has  to  be  generalized  to 

(4)  r(t)  =  £  £  bk(i)sk(t-iT-Tk)  +  n(t), 

i--M  fc-l 


where  by  convention  sk(t)  =  0  for  ft£[0,  T].  Now  we  can  no  longer  restrict  our 
attention  to  the  one-shot  case  because  optimum  decisions  are  based  on  the  whole 
received  waveform  due  to  the  interference  between  the  symbols.  The  optimum 
receiver  [1]  for  the  asynchronous  case  in  the  sense  of  selecting  the  most  likely 
sequence  of  symbols  consists  of  a  front-end  of  matched  filters  (just  as  in  the 
synchronous  case)  followed  by  a  Viterbi  dynamic  programming  algorithm  with 
|A|k_i  states  and  a  periodically  time-varying  branch  metric.  The  TCB  of  this 
decision  algorithm  is  0(|A|K/log|A|),  and  hence  the  penalty  in  time-complexity 
due  to  the  lack  of  synchronism  between  the  users  is  slight.  The  usefulness  and 
relevance  of  the  computational  complexity  results  proved  in  this  paper  stem  from 
the  fact  that  when  the  users  are  asynchronous,  the  cross-correlations  between 


JThe  time-complexity  per  bit  is  defined  as  the  limit  of  the  ratio  of  total  time  to  the  number  of 
demodulated  bits  as  this  goes  to  infinity.  Note  that  any  preprocessing  of  the  signal  cross-correlations 
does  not  affect  TCB. 
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their  signals  are  unknown  a  priori,  and  the  worst-case  TCB  over  all  possible 
mutual  offsets  is  the  complexity  measure  of  interest  since  it  determines  the 
maximum  achievable  data  rate  in  the  absence  of  synchronism  among  the  users. 
Actually,  no  family  of  signature  signals  is  known  to  result  in  optimum  demodula¬ 
tion  with  polynomial-in-K  complexity  for  all  possible  signal  offsets.  So  even  if 
the  designer  of  the  signal  constellation  were  to  include  in  his  design  criterion  the 
complexity  of  the  optimum  demodulator  in  addition  to  the  bit-error-rate  perform¬ 
ance  (which  dictates  signals  with  low  cross-correlations),  he  would  not  be  able 
to  endow  the  signal  set  with  any  structure  that  would  overcome  the  inherent 
intractability  of  the  optimum  asynchronous  demodulation  problem  for  all  possible 
offsets.  On  the  other  hand,  in  the  synchronous  case,  the  designer  of  the  signal 
constellation  has  more  control  on  the  cross-correlations  (subject  to  constraints 
such  as  bandwidth  or  number  of  chips  per  symbol  in  Direct-Sequence  Spread- 
Spectrum),  and  it  is  conceivable  that  there  exist  synchronous  signal  design 
constraints  that  result  in  families  of  signature  signals  whose  structure  can  be 
exploited  to  result  in  optimum  polynomial-time  decision  algorithms.  This  is  the 
reason  why  the  computational  complexity  results  of  this  paper  appear  to  be  more 
relevant  to  asynchronous  channels  even  though  for  the  purposes  of  the  proofs 
in  our  lower  bound  analysis  we  may  restrict  attention  to  the  special  case  where 
all  the  delays  coincide  (t,  =  •  •  •  =  tk),  because  the  optimum  multiuser  detector 
must  be  able  to  deal  with  any  arbitrary  set  of  delays. 

In  order  to  ascertain  that  the  intractability  of  the  optimum  multiuser  problem 
arises  when  the  number  of  users  is  large  and  the  alphabet  size  is  kept  constant, 
we  first  fix  an  arbitrary  alphabet  A  =  {a,, . . . ,  am}  (which  is  a  set  of  integers 
satisfying  a,  <al+l),  and  define  a  class  of  instances  of  the  combinatorial  optimi¬ 
zation  problem  for  that  fixed  A* 

MULTIUSER  DETECTION 

Instance:  Given  KeZ+,yeQK,  and  a  nonnegative  definite  matrix  He  QKxK 
Find :  {b*eAK}  that  maximizes  2bry-brHb. 

Proposition  1.  If  |A|>  1,  then  MULTIUSER  DETECTION  is  NP -hard. 

Proof.  The  proof  of  NP-hardness  of  MULTIUSER  DETECTION  can  be 
carried  out  by  transformation  from  PARTITION,  an  NP-complete  recognition 
problem.  Recall  its  definition  [4]: 

Instance :  Given  Le  Z+  and  {/,  e  Z+,  /=  1, . . . ,  L}. 

Question:  Is  there  a  subset  Jc  {l, . . . ,  L)  such  that  £i6/  4 -I, .,4? 

For  each  instance  cf  PARTITION,  we  can  find  in  polynomial  time  an  instance 
of  MULTIUSER  DETECTION  whose  solution  can  in  turn  be  processed  in 


*  Note  that  since  the  alphabet  A  is  not  part  of  the  instance,  if  a  specific  A  is  assumed,  then  the 
corresponding  NP-hard  result  is  a  corollary  to  Proposition  1.  Actually,  for  A  =  {-1,+1},  the  proof 
of  Proposition  1  can  be  simplified  considerably  by  letting  htJ  =  If  and  y*  =  0  therein. 
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polynomial  time  to  give  an  answer  to  PARTITION.  Given  lu...,lL,v/e  choose 
the  following  instance  of  MULTIUSER  DETECTION: 


K  =  L, 

hi]  —  lflj,  i*j, 


hu  =  [/j  max  j/(,  (a2-  a,)_,[2flm  -  a,  -  a2]  £  (,  j 

J+i 


i  =  K, 


yk =2(01+02) 


^  hkk  +  Ik  I  Ijj  > 

J*k 


k  =  l,...,K. 


Note  that  this  is  a  valid  instance  of  MULTIUSER  DETECTION,  because  H 
is  a  nonnegative  definite  matrix.  Once  the  solution  to  this  instance  of  MULTI¬ 
USER  DETECTION  is  found,  we  can  find  the  solution  to  the  original  instance 
of  PARTITION,  because  {/i,...,4}  is  a  yes  instance  of  PARTITION  if  and 
only  if 

(5)  max2bTy-bTHb=[ 5(0, +  a2)  £  /,]  +axa2  £  (/»„-/*). 

be*K  L  (-1  J  l-t 

Equation  (5)  can  be  shown  by  changing  the  variable  in  the  left-hand  side  of  (5) 
b  =  ^(a2  —  a,)z-t-5(al  +  a2)  and  proving  that  it  is  enough  to  restrict  attention  to  the 
values  z(  =  ±  1  in  the  maximization  in  (5)  (see  [5]  for  details).  □ 

The  foregoing  proof  shows  that  the  same  transformation  works  if  the  value  of 
the  diagonal  elements  of  H  is  arbitrarily  increased.  Hence  MULTIUSER  DETEC¬ 
TION  remains  NP-hard  if  H  is  restricted  to  be  strongly  diagonal  (an  important 
special  case  in  CDMA  with  equal-energy  users).  Note  that  MULTIUSER 
DETECTION  was  defined  for  a  fixed  arbitrary  alphabet.  Thus,  Proposition  1 
implies  that  the  problem  is  inherently  difficult  when  the  number  of  users  is  large, 
regardless  of  the  alphabet  size  (often  a  small  integer).  Conversely,  in  Section  2 
w*»  -aw  that  the  problem  is  polynomial  in  the  alphabet  size  for  fixed  number  of 


3.  NP-hardness  of  Multiuser  Asymptotic  Efficiency.  In  this  section  we  examine 
the  complexity  of  the  performance  analysis  of  optimum  multiuser  detection.  The 
purpose  of  this  analysis  is  to  evaluate  the  effect  of  the  energies  and  cross¬ 
correlations  of  the  signal  constellation  on  the  bit-eiror-rate  of  the  receiver  for  an 
arbitrary  level  of  background  noise.  It  has  been  shown  [6]  that  the  key  perform¬ 
ance  measure  is  the  multiuser  asymptotic  efficiency,  or  ratio  b  tween  the  exponen¬ 
tial  decay  rate  of  the  bit-error-rates  with  and  without  interfering  users.  This 
parameter  effectively  quantifies  the  degradation  in  bit-error-rate  due  to  the  pres¬ 
ence  of  other  users,  in  situations  where  the  background  noise  is  not  dominant. 
The  asymptotic  efficiency  of  the  fcth  user,  rjk,  is  proportional  to  the  Euclidean 
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distance  between  any  pair  of  transmitted  signals  whose  fcth  symbols  do  not  agree 
[6j.  Specifically,  assuming  synchronous  users  and  antipodal  modulation  (i.e., 
A  =  {-1,  +1}),  the  fcth  user  asymptotic  efficiency  can  be  expressed  as 


(6) 


min 
bl*bi  J 

*  T 

0 

l  i 

/ - s 

•*-»  1 

jX 

M  — 

-o 

1 

i _ j 

2 

dt 

4 

s2k(t)  dt 

0 

hkk 


min 

«6(-  1.0.1} 
tt*0 


KeTHe. 


Proposition  2.  The  following  problem  is  UP-hard: 

MULTIUSER  ASYMPTOTIC  EFFICIENCY 

Instance:  Given  KeZ+,  fce{l, . . . ,  K},  and  a  nonnegative  matrix  HeZKxK 
Find:  The  fcth  user  maximum  asymptotic  efficiency, 

Vk  =  (1/Aw)  minee(_l>0>„*,,*fo  £tHe. 

Proof.  The  proof  is  divided  in  two  steps,  first  -1/0/1  KNAPSACK  is  poly¬ 
nomial^  transformed  to  MULTIUSER  ASYMPTOTIC  EFFICIENCY,  and  then 
we  show  that  -1/0/1  KNAPSACK  is  NP-complete.  In  analogy  to  the  0/1 
KNAPSACK  problem  (e.g.,  [7])  we  define 


-1/0/1  KNAPSACK 

Instance :  LeZ+,  GeZ+,  and  a  family  of  not  necessarily  distinct  positive  integers 

{/,eZ+, 

Question:  Are  there  integers  e,  e  {-1, 0, 1},  i  -  1, . . . ,  L,  such  that  e,l,  =  G? 


We  transform  -1/0/1  KNAPSACK  to  MULTIUSER  ASYMPTOTIC 

EFFICIENCY  by  adding  an  additional  user.  Given  {G,l . /J,  denote  4+,= 

G  and  construct  the  following  instance:  K  =  L+ 1,  fc  =  L+ 1,  h,j  =  If,  1  s  i,j  £  K. 

The  Kth-user  asymptotic  efficiency  is  equal  to  0  if  and  only  if  {G,  /,,... ,  lL} 
is  ayes  instance  of -1/0/1  KNAPSACK.  To  see  this,  note  that  we  can  fix  ek  =  -1 
in  the  right-hand  side  of  (6)  without  loss  of  generality.  Then, 


(7) 


Vk  = 


— ;  min 
G  *<€{—i,o,i) 
islsK-l 


SKK 


k- i  r  k-i 

■  1  en  -2 hnK  +  l  emhn 

n  1  L  m*  1 


i  (  K-'  V 

=  min  G-  £  enl„\ 


lS/SK-l 


Now  we  show  that  —1/0/1  KNAPSACK  is  NP-complete.  Its  membership  in 
NP  is  obvious.  Note  that  it  is  easy  to  transform  -1/0/1  KNAPSACK  to  0/1 
KNAPSACK  ({G,  lt, ... ,  lL)  is  a  yes  instance  of  -1/0/1  KNAPSACK  if  and 
only  if  {G,  ,  lL,  -/*.}• is  a  yes  instance  of  0/1  KNAPSACK).  However, 

we  need  to  show  the  reverse  transformation,  namely,  fixing  any  instance  of  0/1 
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KNAPSACK  obtain  an  equivalent  instance  of  -1/0/1  KNAPSACK.  The  idea 
is  reminiscent  of  the  polynomial  transformation  of  0/ 1  KNAPSACK  to  POSITIVE 
INTEGER  KNAPSACK  (see  p.  376  of  [7]),  and  it  consists  of  constructing  an 
augmented  instance  of  -1/0/1  KNAPSACK  whose  integers  are  large  enough  to 
force  every  coefficient  to  be  either  0  or  1.  Choose  an  instance  {G,lu...,lL}  of 
0/1  KNAPSACK,  and  construct  the  following  instance  of  -1/0/1  KNAPSACK 
{A  P\y  •  •  >  P2L.}- 


D  =  G+£  M1, 

t 


Pj  = 


for 

for 


M  =  1  +  max 


1  s/asi, 

i+i<;==2i, 

G+Z  /,]. 

i-l  J 


Then  it  follows  from  this  choice  that  for  all  {e,  s  {-1, 0, 1},  1  s  f  <  2L} 


(8) 


Z  e,Pi-D  =  -G+Z  eJi+1  M‘(et  +  ef+n - 1). 


j«i 


i- 1 


It  is  straightforward  to  show  that  M  is  too  large  to  be  a  root  of  any  L-degree 
polynomial  Zt-oP‘x‘>  where  /3(e{-3, -2, -1,0, 1}  for  1  s/si  and  /306 
{-G+XrL_,  ei^y  £‘e  {“1. 0. 1}»  1  Q-  Hence,  (8)  is  equal  to  zero  if  and  only 
if  all  the  coefficients  of  the  polynomial  in  M  of  the  right-hand  side  are  zero,  i.e., 

2  L  L 

Z£iPi-D  Z  £Ji-G  and  {er  =  0,  ef+n  =  1  or  =  1,  ei+n  =  0, 1  £  /}. 

i-i  i-i 

Therefore,  {G,  lL}  is  a  yes  instance  of  0/1  KNAPSACK  if  and  only  if 

{D,pi,. .  .,P2l)  is  a  yes  instance  of  -1/0/1  KNAPSACK.  □ 

Note  that  the  proof  of  Proposition  2  shows  in  fact  that  the  problem  of  deciding 
whether  a  signal  constellation  is  uniquely  decodable,  i.e.,  whether  different 
transmitted  bits  result  in  the  same  waveform,  is  NP-complete. 


4.  Concluding  Remarks,  Suboptimum  Algorithms,  and  Open  Problems.  It  has 
been  shown  that  the  problem  of  optimum  detection  in  Gaussian  multiple-access 
channels  is  NP-hard  in  the  number  of  users.  This  result  holds  for  any  nontrivial 
alphabet  even  if  the  channel  is  symbol-synchronous.  Exponential-in-K  optimum 
detectors  for  Poisson  multiple-access  channels  with  point-process  observations 
were  obtained  in  [8].  It  can  be  shown  that  this  problem  is  also  NP-hard  in  both 
the  additive-rate  and  additive-light  models  of  the  channel. 
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Not  ohly  is  the  optimum  decision  rule  intrinsically  difficult,  but  so  is  the  analysis 
of  its  performance  due  to  the  NP-hardness  of  the  computation  of  multiuser 
asymptotic  efficiencies.  It  should  be  pointed  out,  however,  that  in  many  instances 
of  signal  constellations  used  in  CDMA  Direct  Sequence  Spread-Spectrum  systems 
[9],  the  cross-correlations  are  low  enough  to  pass  sufficient  conditions  [6]  ensuring 
unit  asymptotic  efficiency  that  are  computable  in  quadratic  time  in  K.  So,  unlike 
the  situation  we  encountered  in  the  optimum  asynchronous  demodulation  prob¬ 
lem,  the  worst-case  complexity  measure  may  be  overly  pessimistic  for  specific 
instances  exhibiting  low  cross-correlations. 

What  alternatives,  then,  does  the  designer  have  when  the  number  of  users  is 
large?  The  suboptimum  solution  currently  employed  in  practice  is  the  bank  of 
single-user  receivers  (i.e.,  a  matched  filter  for  each  user  followed  by  a  threshold). 
Unfortunately,  this  scheme  achieves  far  from  optimum  bit-error-rate  and  its 
performance  breaks  down  when  the  signal  energies  are  dissimilar  (the  near-far 
problem)  [11.  [6],  [10].  Therefore,  the  search  for  approximation  algorithms  that 
achieve  nea  optimum  bit-error-rate  with  polynomial  complexity  appears  to  be 
an  open  resea  ;h  area  with  important  consequences  in  practice.  Numerical  results 
indicate  that  performance  extremely  close  to  the  single-user  lower  bound  is 
achievable  in  the  bit-error-rate  region  of  usual  interest  (10~4  or  less)  by  the 
maximum-likelihood  multiuser  receiver  even  if  the  cross-correlation  qualities  of 
signal  constellations  typically  used  in  practice  are  considerably  relaxed.  This  is 
a  sign  that  the  Viterbi-based  optimum  multiuser  receiver  possesses  an  important 
degree  of  redundancy  in  situations  with  good  signal  sets  and  low  background 
noise  and  hence  faster  decision  algorithms  achieving  similar  performance  are 
plausible.  Furthermore,  a  linear  multiuser  demodulator  whose  TCB  is  linear  in 
K  is  found  in  [10]  to  achieve  the  same  worst-case  asymptotic  efficiency  over  the 
energies  of  the  interfering  users5  as  the  optimum  demodulator.  While  for  specific 
values  of  the  received  energies,  its  asymptotic  efficiency  (and  hence,  its  bit  error 
rate)  need  not  be  close  to  the  optimum  one,  its  performance  is  guaranteed  to 
exceed  a  high  lower  bound  in  all  cases  of  practical  interest,  thus  making  this 
suboptimum  demodulator  an  attractive  choice  from  both  the  complexity  and 
performance  standpoints. 

Finally,  let  us  consider  an  interesting  special  case  of  the  asynchronous  optimum 
multiuser  detection  model  in  (4),  namely  the  single-user  intersymbol  interference 
problem, 


r(t)-  I  b(i)s(t - iT)  +  n(t), 

l—M  • 

where  the  duration  of  s(f)  is  greater  than  T.  In  general,  there  is  no  known  efficient 
method  to  obtain  the  most  likely  sequence  of  transmitted  symbols  given  the 
received  waveform  (TCB  is  exponential  in  the  frame  length  M).  However,  if  the 
number  of  signals  that  interfere  at  any  given  time  is  bounded  by,  say,  L,  then  it 


3  This  is  called  the  near-far  resistance,  a  key  measure  of  the  robustness  of  the  system  against  variations 
in  the  received  energies. 
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is  well  known  [11]  that  maximum-likelihood  sequence  detection  can  be  imple¬ 
mented  by  the  Viterbi  algorithm  in  TCB  which  although  exponential  in  L  is 
independent  of  M.  Despite  many  efforts  (e.g.,  [12]-[14])  motivated  by  the  impor¬ 
tance  of  this  problem  in  the  area  of  data  transmission  through  bandlimited 
channels,  no  polynomial-in-L  algorithm  for  maximum-likelihood  sequence  detec¬ 
tion  is  known.  This  fact  and  the  results  of  this  paper  lead  us  to  suspect  that  we 
may  be  facing  another  NP-hard  problem.  In  fact  following  the  same  steps  as  in 
Section  2,  it  can  be  seen  that  the  most  likely  sequence  of  symbols  corresponding 
io  (13)  is  the  one  that  maximizes  2bTy-bTHb,  where  b  =  (6(-M),  ...,6(M)), 
y  =  (y(-M),  ...,y(M)),  y(i)  =  js(t-iT)r(t)dt,  and  H  is  the  nonnegative 
definite  Toeplitz  matrix  (i.e.,  constant  along  diagonals)  with  entries  given  by 
h,j  =  J  s(t  -  iT)s(t  -jT)  dt.  Hence  if  we  specialize  L  =  2M  + 1  =  K,  the  underlying 
combinatorial  problem  coincides  with  MULTIUSER  DETECTION  with  an 
additional  restriction  on  the  data: 

Conjecture  1.  MULTIUSER  DETECTION  remains  NP-hard  if  H  is  restricted 
to  be  Toeplitz. 

Indeed,  it  appears  that  the  Toeplitz  condition  imposes  an  analytically  incon¬ 
venient  restriction  on  the  set  of  allowable  instances.  A  possible  route  is  to  consider 
the  following  restricted  version  of  :he  problem. 

FIR 

Instance:  Given  LeZ+,  Ee Z+,  and  the  coefficients  of  a  finite-impulse  response 
(FIR)  digital  filter  of  length  L  (h,eZ,  i  =  0, . .. ,  L- 1). 

Question:  Does  there  exist  an  input  sequence  (6,6  {-1,  +1},  i  =  0, . . . ,  L- 1)  such 
that  the  output  energy  of  the  FIR  is  less  than  E,  i.e., 

1L- 2  /L-l  \2 

l  l  bjhH)  <£? 

(-0  \)-0  / 

Conjecture  2.  FIR  is  NP-complete. 

Similarly,  the  problem  of  the  performance  analysis  of  the  single-user  intersym¬ 
bol  interference  channel  is  equivalent  to  finding  the  minimum  distance  between 
any  pair  of  transmitted  data  streams.  This  problem  for  which  no  polynomial 
algorithm  in  the  length  of  the  interference  is  known  can  be  put  as  a  special  case 
of  MULTIUSER  ASYMPTOTIC  EFFICIENCY  and  it  is  not  known  whether  it 
is  NP-hard. 
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ABSTRACT 


In  this  paper  we  consider  a  noncoherent,  optical,  asynchronous,  code  division  multiple 
access  ( CDMA)  system.  We  present  a  semi-classical  analysis  of  the  error  rate  for  a  single- 
user,  matched-filter  receiver  that  applies  for  arbitrary  photomultipliers  and  signature 
sequence  sets,  adheres  fully  to  the  semi-classical  model  of  light,  and  does  not  depend  on 
approximations  for  large  user  groups,  strong  received  optical  fields,  or  chip  synchronism. 
We  compare  the  exact  minimum  probability  of  error  and  optimal  threshold  to  those 
obtained  with  popular  approximations  on  user  synchronism  or  on  the  distribution  of  the 
multiple  access  interference  (MAI).  For  the  special  case  of  unity-gain  photodetectors  and 
prime  sequences,  we  show  that  the  approximation  of  chip  synchronism  yields  a  weak 
upper  bound  on  the  exact  error  rate.  We  demonstrate  that  the  approximations  of  perfect 
optical-to-electrical  conversion  and  Gaussian-distributed  MAI  yield  a  poor  approximation 
to  the  minimum  error  rate  and  an  underestimate  of  the  optimal  threshold.  In  this  paper 
we  also  develop  arbitrarily-tight  bounds  on  the  error  rate  for  unequal  energies  per  bit.  In 
the  case  when  the  signal  energies  coincide,  these  bounding  expressions  are  considerably 
easier  to  compute  than  the  exact  error  rate. 


1.  Introduction 

Several  users  may  independently  access  a  common  communication  channel  using  code  division  mul¬ 
tiple  access  (CDMA)  modulation.  This  multiaccess  scheme  does  not  use  time-  or  frequency-allocation, 
and  users  may  transmit  without  the  delays  inherent  to  multiaccess  protocols.  In  the  direct-detection 
optical  CDMA  channel,  interference  immunity  is  achieved  by  the  assignment  of  rapidly  varying,  on- 
off  waveforms.  These  waveforms,  or  signature  sequences,  are  modulated  by  the  data  of  each  user  and 
concentrate  the  transmitted  energy  into  relatively  short  time  intervals  in  each  symbol  period.  The  trans¬ 
mitted  signals  from  the  users  are  then  combined  on  a  common  optical  fiber.  Single-user  demodulation 
is  (suboptimally)  achieved  by  correlating  the  aggregate  signal  and  the  signature  sequence  of  the  desired 
user.  As  a  result,  the  correlator  output  is  the  desired  signal  in  additive  interference,  which  is  reduced 
through  the  use  of  signature-sequence  sets  with  low  cross-correlation.  A  correlator  receiver  of  this  type 
must  know  only  the  timing  epoch  of  the  desired  user,  and  a  common  timing  reference  need  not  be  sent 
to  all  transmitters. 

In  this  paper  we  present  the  error  rate  of  a  particular  single-user  receiver  in  the  noncoherent  CDMA 
optical  fiber  channel.  This  receiver  has  been  the  focus  of  previous  analyses  and  local  area  network  pro¬ 
totypes  [1-3].  In  contrast  to  previous  efforts  we  have  avoided  making  approximations  to  three  analytical 
obstacles.  First,  our  analysis  retains  the  quantized  nature  of  electromagnetic  radiation.  While  the  par¬ 
ticle  nature  of  radiation  may  be  neglected  for  the  analysis  of  microwave  communication  systems,  optical 

f  This  work  was  supported  by  the  U.S.  Army  Research  Office  under  Contract  DAAL03-87-K-0062. 
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.''nrr-mation  of  a  surface  is  most  accurately  described  by  a  photon  arrival  process,  a  doubly-stochastic 
point  process  whose  random  rate  is  proportional  to  the  intensity  of  the  electromagnetic  field  integrated 
over  the  illuminated  surface.  In  particular,  this  holds  for  field  intensities  typical  of  received  signals  in 
direct-detection  optical  systems  [4-6].  As  might  be  expected,  it  is  convenient  to  analyze  receivers  based 
on  observations  of  the  received-field  intensity,  rather  than  on  observations  of  a  point  process  driven  by 
this  intensity.  This  approximation  is  equivalent  to  neglecting  the  quantization  of  energy  at  the  pho¬ 
todetector  screen,  and  is  increasingly  inaccurate  as  the  received  optical  power  decreases.  Second,  we 
have  avoided  the  use  of  a  Gaussian  approximation  of  the  MAI  distribution.  Due  to  low-weight  signature 
sequences,  the  MAI  is  additively  composed  of  cross-correlations  that  are  usually  limited  to  a  few  chips 
of  coincidence,  and  its  maximum  is  too  small  to  apply  central-limit-theorem  arguments.  As  might  be 
expected,  the  derivation  of  the  optimal  hypothesis  test  ^.id  the  error  analysis  are  complicated  by  the 
nature  of  the  MAI  distribution  and  would  be  simplified  if  the  MAI  had  Gaussian  statistics.  Third, 
we  have  avoided  a  chip-synchronous  approximation  on  the  relative  delays  between  the  users,  in  which 
case  the  relative  delays  are  integer  multiples  of  the  chip  interval.  Since  CDMA  is  a  form  of  multiuser 
modulation  without  transmission  coordination,  the  relative  delay  between  any  two  users  is  uniformly 
distributed  on  the  symbol  period.  The  relative  delays  affect  the  distribution  of  the  MAI,  which  would 
be  simplified  under  the  approximation  of  chip  synchronism. 

Previous  work  has  addressed  the  error  rate  of  optical  CDMA  receivers  through  the  use  of  these 
simplifying  approximations.  A  CDMA  optical  receiver  using  a  post-photodetection  (electrical)  matched 
filter  and  Gold  sequences  was  analyzed  in  [1],  The  receiver  decides  for  the  data  of  the  user  of  interest 
based  on  an  observation  of  the  optical  intensity,  rather  than  on  a  filtered  point  process  driven  by  the 
intensity.  This  approximation  is  known  as  “perfect  optical-to-electrical  conversion,”  and  ignores  the 
quantized  nature  of  lighu.  Chip-synchronous  transmission  was  also  assumed,  and  the  number  of  users 
was  considered  large  enough  to  model  the  MAI  as  a  Gaussian  random  variable.  Finally,  the  dark  current 
from  the  photodetector  was  ignored.  A  noncoherent,  optical  matched-filter  CDMA  receiver  employing 
prime  codes  was  analyzed  in  [2].  This  receiver  was  not  limited  by  the  speed  of  electronic  processing  as 
in  [1],  since  the  matched-filter  operation  was  performed  optically.  The  authors  assumed  perfect  optical- 
to-electrical  conversion  and  Gaussian-distributed  MAI.  With  these  approximations  the  observation  is  a 
deterministic  signal  in  Gaussian  noise,  and  the  authors  demonstrated  the  superior  performance  of  prime 
codes  over  Gold  codes  by  a  comparison  of  the  signal-to-noise  ratios. 

Recently,  a  two-part  paper  explored  the  performance  of  an  optical  CDMA  system  using  the  optical 
matched-filter  receiver  [7,8].  The  authors  computed  upper  and  lower  bounds  to  the  single-user  error  rate 
for  those  signature  sequences  whose  periodic  cross-correlations  are  limited  to  one  chip  of  coincidence. 
The  analysis  ignores  dark  current  and  the  quantized  nature  of  light.  With  these  approximations  the 
observation  is  composed  of  the  desired  user’s  energy  in  the  additive,  aggregate  interference,  and  the 
receiver  compares  this  statistic  to  a  threshold  in  order  to  decide  on  the  transmitted  information.  Error 
rate  bounds  were  obtained  by  considering  bounds  on  the  variance  of  the  single-user  interference.  Since 
the  bounds  on  the  variance  of  each  interferer  were  independent  of  the  corresponding  signature  sequences, 
the  authors  avoided  the  need  to  compute  all  interference  distributions.  The  upper  bound  to  the  variance 
was  given  by  the  chip-synchronous  interference  distribution,  and  the  lower  bound  followed  from  an  ideal 
set  of  codes  in  which  the  cross-correlations  were  strictly  less  than  one  chip  of  coincidence.  The  error  rate 
bounds  were  compared  as  a  function  of  the  threshold,  and  differed  by  more  than  2  orders  of  magnitude 
for  most  thresholds  and  numbers  of  transmitters  of  interest.  Upper  bounds  were  also  computed  for  the 
error  rate  when  the  matched-filter  receiver  is  preceded  by  an  optical  hard  limiter.  An  ideal  optical  hard 
limiter  is  a  nonlinear  device  which  blocks  the  incident  light  for  an  input  intensity  below  a  fixed  minimum 
value,  and  otherwise  limits  the  output  intensity  to  this  minimum  value.  When  the  threshold  of  the  hard 
limiter  is  set  to  the  intensity  of  a  single-user,  the  presence  of  the  desired  signal  may  be  known  exactly, 
as  before.  In  the  absence  of  the  desired  signal,  the  intensity  of  the  aggregate  interference  is  reduced 
and  bounded  by  that  of  a  single  user.  In  this  way,  an  optical  hard  limiter  reduces  the  error  rate  by 
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clipping  the  aggregate  interference' intensity  due' to  the  overlap  of  two  of  more  users’  waveforms.  Due 
to  the  nonlinear  operation  of  the  hard  limiter,  though,  the  output  intensity  is  no  longer  the  sum  of 
independent  signals,  and  an  exact  erro'  analysis  is  complicated  by  this  nonlinear  transformation.  The 
authors  derived  an  upper  bound  to  the  error  rate  by  overestimating  the  probability  of  interference  in  each 
non-zero  chip  of  the  desired  signature  sequence.  This  probability  was  bounded  by  randomly  selecting 
one  of  the  interferes  in  each  non-zero  •'hip  to  represent  the  interference,  and  the  remaining  users  were 
permitted  to  interfere  in  the  next  chip  position.  Thus,  each  user  could  interfere  repeatedly  until  he  is 
chosen  to  represent  the  interference  in  a  particular  chip.  This  work  compared  the  upper  bounds  to  the 
error  rate  of  the  matched-filter  CDMA  receiver  with  and  without  an  optical  hard  limiter,  and  showed 
that  many  more  users  may  be  accommodated  with  the  incorporation  of  the  optical  hard  limiter.  In  both 
upper  bounds,  the  convolution  of  three  of  more  iid  uniform  random  variables  on  [0, 1]  was  approximated 
by  the  Gaussian  distribution. 

An  artifact  of  perfect  optical-to-electrical  conversion  is  “error-free”  transmission,  which  occurs  for  a 
sufficiently  small  population  of  interferers.  Under  this  approximation  the  observable  is  the  total  received 
energy  in  the  symbol  interval,  which  is  additively  composed  of  the  desired  user’s  energy  and  that  of  the 
interferers.  Since  the  MAI  has  a  maximum  value  which  is  proportional  to  the  number  of  interferers,  one 
may  completely  separate  the  ranges  of  the  observable  under  each  hypothesis  if  the  number  of  interferers  is 
small  enough,  and  may  specify  an  error-free  threshold  test.  When  the  random  nature  of  the  photodetector 
output  current  is  retained,  this  effect  is  lost. 

In  summary,  recent  analyses  of  noncoherent,  optical  CDMA  receivers  have  relied  on  the  approx¬ 
imations  of  perfect  optical-to-electrical  conversion,  G  "ssian-distributed  MAI,  and  chip  synchronism. 
The  accuracy  of  these  approximations  is  not  known,  riow  much  do  the  approximations  of  Gaussian- 
distributed  MAI  and  perfect  optical-to-electrical  conversion  change  the  error  rate?  Does  the  approxima¬ 
tion  of  chip  synchronism  yield  an  error  rate  that  is  close  to  the  exact  error  rate?  How  small  is  the  exact 
error  rate  when  the  “error-free”  condition  is  satisfied? 

In  this  work  we  derive  the  exact  error  rate  for  the  icoherent,  optical  matched-filter  CDMA 
receiver,  which  decides  for  the  data  of  a  single  user  by  comparing  a  photoelectron  count  to  a  threshold. 
The  results  adhere  fully  to  the  semi-classical  model  of  light  and  do  not  depend  on  limit  theorems  for 
large  user  groups  or  strong  received  optical  fields.  The  analysis  is  valid  for  arbitrary  quantum  efficiencies, 
binary  signature  sequences,  random  gain  distributions,  and  dark  currents,  and  is  broad  in  app’  tion. 
In  Section  2  we  describe  noncoherent,  optical  CDMA  modulation  and  consider  single-user  demc  ,tion 
based  on  a  conditionally  compound-Poisson  observation.  We  derive  the  probability  mass  function  MF) 
of  the  observation  under  each  hypothesis  in  Section  3,  and  use  them  to  determine  the  optimal  oshold 
and  minimum  probability  of  error.  What  makes  the  analysis  particularly  interesting  is  the  fact  that 
while  the  formal  representation  of  the  PMF  for  a  doubly-stochastic  compound  Poisson  count  is  readily 
derived  via  conditioning  [9],  explicit  forms  are  not  common.  Due  to  the  particular  nature  of  the  MAI 
distribution,  we  are  able  to  show  that  the  PMF  may  be  expressed  as  a  straightforward  summation.  This 
expression  is  derived  for  independent  and  identically-distributed  (iid)  interferers  having  a  distribution 
that  includes  the  cases  of  user  asynchronism  and  chip  synchronism.  It  will  be  seen  that  the  error  rate 
expression  is  simplified  when  the  distribution  of  the  MAI  is  discrete.  In  Section  4  we  take  advantage  of 
this  fact  to  derive  arbitrarily-tight  bounds  on  the  error  rate  which  are  considerably  easier  to  compute 
than  the  exact  error  rate.  In  Section  5  we  use  the  same  bounding  technique  to  derive  arbitrarily-tight 
bounds  on  the  error  rate  when  the  interferers’  energies  are  not  identical.  In  Section  6  we  focus  on 
the  special  case  of  prime  codes,  equal  energies,  and  unity-gain  photodetectors  in  order  to  compare  the 
optimal  threshold  and  minimum  error  rate  to  those  obtained  using  the  approximations  discussed  above. 
The  approximation  of  perfect  optical-to-electrical  conversion  yields  poor  estimates  of  ‘he  error  rate  and 
optimal  threshold  at  moderate  incident  optical  intensities  and  dark  currents.  Further,  the  combined 
approximations  of  perfect  optical-to-electrical  conversion  and  Gaussian-distributed  MAI  together  yield 
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an  underestimate  of  the  optimal  threshold  and  an  error  rate  that  is  neither  an  upper  nor  a  lower  bound. 
We  also  show  that  when  prime  sequences  are  employed,  the  chip-synchronous  approximation  leads  to 
an  overestimate  of  the  error  rate.  The  validity  of  these  approximations  for  larger  optical  powers  is  also 
discussed  in  Section  6. 


2 .  Optical  CDMA  Model 

During  each  length-T  symbol  interval,  the  ith  transmitting  laser  is  amplitude-modulated  by  the 
product  of  the  data,  which  takes  on  values  in  {0, 1},  and  an  assigned  signature  sequence.  In  this  work,  a 
signature  sequence  is  a  deterministic,  {0,  l}-valued,  piecewise-constant  function  on  (0,  T],  and  is  specified 
by  the  values  that  it  takes  on  the  N  equal-length  subintervals  (chips)  of  [0,  T].  We  define  Pj  —  P  as  the 
number  of  non-zero  chips  in  the  jth  signature  sequence,  bjn  as  the  transmitted  symbol  of  the  jth  user 
in  the  interval  (nT,  (n  +  1  )T\,  and  cj(t)  as  a  periodic  replication  of  the  jth  signature  sequence.  The 
transmitted  complex  scalar  field  from  the  jth  laser  may  be  expressed  as 


where  s  is  proportional  to  the  optical  energy  per  bit  of  the  transmitting  laser,  vj  denotes  the  optical 
carrier  frequency  of  the  jtfl  user,  and  9j  is  the  phase  offset  of  the  jth  laser  from  the  first  laser.  Thus 
we  have  assumed  that  all  users  transmit  with  identical  signal  energies.  This  is  not  the  case  in  general, 
and  in  Section  5  we  consider  the  more  general  case  of  unequal  signal  energies.  The  laser  phase  noise  is 
represented  by  ay  Wy(<),  where  ay  is  related  to  the  jtfl  transmitting  laser  linewidth,  Bj,  by  ay  = 

The  relative  delays  {ry}|l2  are  defined  on  (0,  T)  with  reference  to  the  receiver  of  the  first  user.  As  there 
is  no  cooperation  between  the  users,  it  is  appropriate  to  model  the  relative  delays  as  iid  random  variables 
that  are  uniformly  distributed  on  the  interval  [0,  T).  We  shall  assume  that  the  symbol  rate  of  each  user  is 
the  same,  the  optical  fields  of  the  K  users  add  in  a  noncoherent  fashion,  and  that  each  single-user  receiver 
acquires  the  timing  of  its  transmitter’s  symbol  epochs.  With  ideal  transmission,  (1)  also  represents  the 
complex  scalar  field  at  the  first  receiver  due  to  user  j. 

During  the  time  interval  [0,  T),  the  intensity  of  the  total  optical  field  at  the  receiver  of  the  first  user 
is 

AT  K 

|r(t)|2  =  -  Tj)pt(0,Tj)  +  bj0Cj(t  -  Tj)pt(Tj,T),  (2) 

j- 1 

where  pt{a,b)  is  a  rectangular  pulse  of  unit  height  with  support  [a,t>).  We  are  interested  in  one-shot 
detection  of  the  data  6io  based  on  an  observation  of  a  photon  count  in  [0,T).  The  underlying  photon 
point  process  is  driven  by  a  filtered  version  of  |r(t)|2,  which  depends  on  the  data  bio  only  at  times 
{<|cj (t)  =  1,  0  <  t  <  T}.  This  follows  from  (2)  and  the  additional  fact  that  rj  =  0.  By  correlating 
the  received  point  process  in  (0,T)  with  ci(t),  one  may  obtain  the  photon  count  during  the  support 
of  ci  and  may  then  decide  on  the  data  6jo  based  on  this  count.  This  suboptimal  processing  scheme 
is  the  basis  for  the  matched-filter  CDMA  receiver.  Since  the  function  c\(t)  takes  values  on  {0,1},  the 
correlation  is  easily  achieved  at  low  chip  rates  by  an  electro-optic  modulator,  which  allows  light  to  pass 
only  when  c\{t)  =  1.  A  fiber  optic  tap  delay  line  may  be  used  to  achieve  the  matched-filter  operation  at 
higher  chip  rates.  This  all-optical  device  uses  the  finite  propagation  velocity  of  light  to  achieve  a  relative 
delay  between  two  optical  signals  by  passing  them  through  fibers  of  different  lengths.  The  matched-filter 
CDMA  receiver  has  been  studied  in  several  experiments  [2,3]  and  will  be  the  CDMA  receiver  analyzed 
in  this  work. 

As  seen  in  (1),  this  CDMA  system  employs  a  form  of  on-off  modulation  in  which  no  light  is  trans¬ 
mitted  for  a  "0”,  and  the  signature  sequence  is  transmitted  for  a  “1”.  This  receiver  has  been  analyzed 
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previously  using  the  point  process  model  [10].  An  optical  CDMA  system  employing  a  different  modu¬ 
lation  scheme  has  also  been  analyzed  using  the  point  process  model  [11].  In  that  system  each  user  is 
assigned  two  signature  sequences,  one  for  each  symbol  value.  The  receiver  matched-filters  the  amplified 
photocurrent  by  the  difference  of  the  signature  sequences  of  the  desired  user,  and  compares  the  output 
to  a  threshold. 


Figure  1.  The  optical,  noncoherent  matched-filter  CDMA  receiver 


The  matched-filter  CDMA  receiver  to  be  analyzed  in  this  paper  is  shown  in  Figure  1.  The 
total  received  optical  signal  r(t)  is  coupled  to  a  1  xP  beam  splitter.  Each  of  the  outputs  of  the  splitter  are 
identical  copies  of  the  input  signal  that  are  attenuated  in  intensity  by  P.  The  ith  tap  delays  the  received 
field  so  that  the  optical  signal  during  the  ith  non-zero  chip  of  the  first  signature  sequence  overlaps  in  time 
with  the  last  non-zero  chip  in  the  undelayed  signal.  The  delayed  signals  are  noncoherently  recombined, 
and  the  aggregate  signal  is  incident  on  a  photomultiplier  screen.  The  photomultiplier  emits  a  random 
number  of  secondary  (output)  electrons  for  each  detected  photon  or  thermoelectron,  and  the  matched- 
filter  receiver  compares  the  secondary  electron  count  during  the  last  non-zero  chip  interval  of  the  first 
signature  sequence  to  a  threshold  in  order  to  decide  on  the  value  of  6xo-  For  the  remainder  of  this  work, 
we  denote  this  secondary  electron  count  by  Af. 

We  shall  employ  a  common  photomultiplier  model,  in  which  the  intensity  of  primary  electrons 
is  given  by  a|r(t)|2  +  /3,  where  a  is  proportional  to  the  quantum  efficiency  of  the  photodetector,  and 
/?  denotes  tue  rate  of  primary  electrons  due  to  an  independent  dark  current.  The  nth  primary  electron 
yields  a  random  number,  gn,  of  secondary  (output)  electrons,  and  the  collection  {<7n}  is  modeled  as  an 
iid  sequence,  which  is  independent  of  the  primary  electron  point  process  [12].  The  common  probability 
generating  function  of  the  random  gains  is  denoted  as  G(z)  =  J^T=0  zk/R[dn  =  &].  It  follows  that  Af  is 
a  conditionally  compound-Poisson  random  variable  given  the  integrated  intensity,  which  we  define  as  A, 

A  =  —  /  c1(f)[a|r(<)|2  +  0\  dt  =  as  610  +  d  +  ^  ^  bj-iRji(rj)  +  bj0Rji(rj),  (3) 

“  °  jsz  2 

and  the  conditional  distribution  of  Af  depends  only  on  G(z)  and  A.  Here  Rji(r)  and  Rji(r)  are  the 
normalized,  partial  cross- correlations 

RMt)  =  J!  JQ  cj(t  -  T)d(t)dt 
Rjl(r)  =  j:  Jr  Cj(t  -  r)a(t)dt, 

that  represent  the  contributions  to  A  by  user  j  for  the  duration  of  bj-\  and  bjo,  respectively.  In  (3) 
d  represents  the  portion  of  the  primary  electron  count  mean  due  to  thermoelectrons.  Without  loss  of 


5 


Figure  2.  Probability  density  function  of  -type  random  variable. 


generality  we  set  the  quantum  efficiency  of  the  photodetector  to  unity,  as  this  affects  the  distribution 
of  Af  as  an  attenuation  in  intensity.  For  the  same  reason,  we  neglect  the  combining  loss  in  the  coupler. 

Further,  we  define  *  =  s6io  =  0  under  hypothesis  Ho  and  z  —  s  under  hypothesis  Hi 

In  summary,  Af  is  conditionally  compound-Poisson-distributed  with  conditional  primary  elec¬ 
tron  mean  A.  This  conditioned  mean  has  a  distribution  which  is  shifted  to  the  right  under  Hi,  and  the 
receiver  decides  for  Hi  if  Af  exceeds  some  threshold.  In  general,  a  single-threshold  test  is  not  optimum, 
although  sufficient  conditions  for  the  optimality  of  single-threshold  detection  have  been  determined  [13- 
15].  An  important  outcome  of  this  work  is  to  determine  the  threshold  which  minimizes  the  error  rate  of 
the  receiver.  In  other  words,  we  shall  find  the  minimum  error  rate  detector  among  the  class  of  detectors 
that  compares  the  count  Af  to  a  threshold.  Since  AT  is  integer-valued,  it  is  straightforward  to  find  this 
threshold  given  the  probability  mass  functions  (PMFs)  of  Af  under  each  hypothesis.  We  find  the  PMFs 
of  A f  in  the  next  section. 


3.  Derivation  of  V[Af  —  n  \  x\ 


In  this  section  we  derive  the  PMF  of  the  secondary  electron  count  at  the  integrator  output  for 
an  arbitrary  photomultiplier  and  for  chip-synchronous  or  completely  asynchronous  transmission.  This 
expresJon  will  be  used  in  Section  4  to  develop  arbitrarily-tight,  computationally-efficient  bounds  on  the 
cumulative  distribution  function  of  Af,  and  in  Section  6  we  will  use  the  probability  mass  functions  of 
Af  in  order  to  compare  the  exact  error  rates  and  optimal  thresholds  to  those  obtained  using  popular 
approximations. 

In  order  to  concisely  describe  the  statistics  of  A,  we  define  a  7-type  random  variable  to  be  a 
mixed  random  variable  having  point  masses  at  the  integers  {0, 1, . . .  M }  for  some  positive  integer  M,  and 
constant,  continuous  portions  between  these  integers.  For  fixed  M,  the  class  of  7-type  random  variables 
is  parameterized  by  2 M  parameters,  each  taking  values  on  [0, 1].  If  /  is  a  7-type  random  variable  with 
2 M  parameters,  we  define 

d(i)  =  V[I  =  i]  i  e  {0,1,...,M} 

c(j)  dv  =  V [I  €  [v,  v  +  dv)]  [v,v  +  dv)  C  (j,j  +  1)  j  G  {0,1,..., M  -  1}, 

and  we  denote  the  distribution  of  I  as  {d(0),  d{\), d(M),  c(0), . . . ,  c(M  -  1)}.  Figure  2  illustrates 
the  density  of  a  7-type  random  variable. 
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Each  normalized,  partial  cross-correlation  is  a  7-type  random-variable  with  2 M  parameters, 
where  M  is  an  upper  bound  on  the  set  of  total  cross-correlations  {Rjk  +  Rjk}.  This  follows  in  part 
from  the  fact  that  each  partial  cross-correlation  is  piecewise-linear  in  the  relative  delay,  since  it  is  a 
convolution  between  piecewise-constant  functions. f  In  fact,  Rjk(r)  attains  integer  values  at  the  chip- 
synchronous  delays  r  €  {77 11,  n  =  0, . . .  N  —  1},  and  is  piecewise-linear  between  these  values.  If  the 
common  distribution  of  the  relative  delays  is  discrete  on  {^T,  n  =  0, . .  .  jV  -  1},  then  it  is  clear  that 
Rjk{T)  >s  7"kype  with  c(q )  =  0  Vq.  It  follows  that  under  the  assumption  of  chip  synchronism,  the 
distribution  of  the  partial  cross-correlations  is  7-type.  In  the’  asynchronous  case  the  relative  delays  are 
uniformly  distributed  on  [0,T],  which  combined  with  the  piecewise  linearity  of  Rjk(r)  also  yields  a 
7-type  distribution.  In  this  case,  point  masses  arise  due  to  constant  portions  of  the  cross-correlations. 
Further,  c(q)  is  strictly  positive  if  and  only  if  the  normalized,  partial  cross-correlation  takes  on  both  a 
value  strictly  greater  than  q  and  a  value  less  than  or  equal  to  q  on  the  set  of  chip-synchronous  delays. 
From  the  piecewise-linearity  of  Rjk ,  it  is  also  clear  that  the  distribution  of  Rjk  for  user  asynchronism 
may  be  computed  from  a  knowledge  of  the  function  Rjk  evaluated  only  at  the  chip  synchronous  delays. 
The  M  =  1,  7— type  distribution  has  been  used  in  the  analysis  of  optical  CDMA  systems  employing  a 
subclass  of  OOCs  [7].  This  analysis,  however,  did  not  employ  a  point  process  model. 

In  this  section  we  will  derive  the  probability  generating  function  (PGF)  of  Af  from  its  condi¬ 
tional  compound-Poisson  nature,  and  show  that  this  z-transform  has  a  particularly  straightforward  and 
explicit  Maclaurin  series  expansion.  The  PMF  is  the  collection  of  coefficients  of  this  series,  and  may  be 

explicitly  represented;  as  follows.  By  conditioning  on  {z,(i?2i,  ^21) . (■^/fi)-RA'l)}>  the  count  Af  has 

a  compound  Poisson  distribution,  whose  PGF  is  given  by  (e.g.,  [16]) 


E 


=  e(*+4)(G(z)-l) 


K 

x  JJ  e^r^'"lR'»1+^0^,}(Gr(*)-1)>  (4) 

j= 2 


Due  to  the  mutual  independence  of  the  pairs  {(i?^ ,  Rji)}f=2,  the  closed  form  of  the  PGF  E  [zjV  j  *]  may 
be  found  by  smoothing  each  factor  in  (4)  individually.  The  jth  factor  depends  on  the  quantity  + 

bjO&jU  which  we  describe  as  the  jth  interference  mixture.  It  is  clear  that  the  jth  interference  mixture  is 
also  of  7-type  with  2  M  parameters,  and  we  denote  its  distribution  as  {Dj(  0),  Dj(l), . . . ,  Dj(M),  Cj{  0), . . 
1)}.  Since  the  mixture  coefficients  take  values  in  the  set  {0, 1},  the  distribution  of  the  inter¬ 

ference  mixtures  is  computed  easily  from  the  marginals  of  the  partial  and  total  cross-correlations.  With 
this  notation,  the  closed-form  expression  of  the  power  series  of  interest  is 


r  , ,  .  K  ,  M 

K  |  *]  =  «(°W-1)(*W)  x  II  E  DM  exp (,i(GW  -  1)) 

j=2  %=0  r 

-  7  T-  G(.-)  E  Oi(>-)exp(r-(0(2)  -  l))j. 


E 


(5) 


We  are  interested  in  finding  V[Af  —  n  |  *],  the  coefficient  of  zn  in  the  power  series  of  (5)  about  z  =  0. 
This  power  series  is  straightforward  but  unnecessarily  general  for  most  signature  sequence  sets  of  interest. 
For  example,  the  number  of  parameters  in  the  power  series  is  reduced  by  a  factor  of  K  -  1  by  assuming 
that  the  distribution  of  the  jth  interference  mixture  is  independent  of  j  -  that  is,  the  contribution  of 
user  j  to  the  MAI  is  statistically  indistinguishable  from  the  other  interferers.  We  have  verified  that  this 
is  a  reasonable  approximation  when  the  signature  sequences  come  from  the  prime  codes,  and  will  drop 
the  subscript  from  the  distribution  of  the  interference  mixtures  for  the  sake  of  clarity.  Also,  the  power 

f  We  emphasize  that  in  this  work  the  signature  sequences  are  deterministic  and  arbitrary.  The 
randomness  in  the  cross-correlations  is  due  solely  to  the  relative  delays. 
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series  of  (5)  is  concisely  written  if  we  define  C(-l)  =  C(M)  =  0.  With  these  simplifications,  (5) 
becomes 


E  [**  I  x]  =  e(G(*)-i)(*+d)x  j£p(g)eXp(  •  (<*(,)  _  !)) 

P  i  M  ^  K— 1 

-  -1)-  cM*qi  (GW_1)  •  w 

3  1  ~  J=o  J 


There  are  2M  terms  inside  of  the  braces.  Letting  nq  be  the  power  of  D(q),  and  mq  be  the  power  of 
[C(q  -  1)  —  C(q)]  in  a  term  of  a  multinomial  expansion  of  (6),  we  may  express  the  PGF  of  Af  as 


E  K  H  =  £  n'M  ~ ^ ;  {  n  W"  [j{c(,)  -Oh  -  i)}l 

1  1  1  ll,=On?!nV  I  9=0  L3  J 


exp[(C?(z)  -  1)  {s  +  d  -f  -p  +  mi]}] 

(i  —  G(z)y^i=om' 


(V 


where  the  outer  summation  is  over  all  the  indices  such  that  mg  +  ni  =  K  —  1.  We  find  the  PMF 
of  A f  in  the  following  way.  Suppose  that  we  knew  the  coefficients  of  the  following  power  series 


n= 0 


eo(G(*)-l) 

(i  -o(z)Y  * 


a  e  IR+,  8  e  {0,1,2...}. 


(8) 


Recognizing  the  similarity  of  the  last  line  of  (7)  with  the  right  hand  side  of  (8),  we  could  express  the 
PMF  for  AS  in -terms  of  these  coefficients  as 


VW=n\  4=DJ--|)!-.  n  DW'  [“Wf)  -  c(?  -  1)>T 

n9=o  U  J 

/  M  M  \ 

W  +  m/]},]Trn/  , 

\  r  1=0  1=0  J 


(9) 


where  the  outer  summation  is  over  all  mq  and  nq  such  that  the  mq  +  nq  =  K  -  1.  In  fact, 
W(n,  or,  6)  may  be  calculated  by  a  linear  recursion  on  the  integers  n  and  6.  This  recursion  for  W  is 
most  easily  seen  by  re-expressing  the  following  identity  using  (8) 


e«(G(z)-l) 

(1  -  G(z))6+l 


e«(G(z)-l)  _  ca(G(*)-l) 

G(z)  (1  -  G(z))s+'  +  (1  -  G(z))6' 


5  €  {0,1,2,...}. 


We  recall  that  G(z)  is  the  PGF  of  the  photomultiplier  gain  distribution,  G(z)  =  ^}1q  zl7P[g  =  ij.  It 
follows  from  this  substitution  that  for  all  n,  6  g  {0, 1,2,.. .}  the  linear  recursion  for  W  is 


n+l 

( 1  -V[g  =  0])W(n  +  l,a,<5+l)  =  £Pk  =  /]  W(n  +  1-1,  oc, 6  +  1)  +  W(n  +  1,  a,  6)  (10) 

1=1 


For  most  photomultiplier  models  V[g  =  0]  =  0,  which  we  will  assume  in  the  sequel.  The  initial  conditions 
of  this  recursion  are  also  easily  extracted  from  the  definition  of  W, 

W(0,<*,<$)  =  e-a  ,  *€  {0,1,2,...}  (11) 
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and 


n  k 
<*  -c 


r  k 


W(n,a,0)  =  X;  — e"“  V 


k=0 


E^l  =  « 

U—l 


n  €  {0,1,...}. 


The  linear  recursion  for  W(j,a,6)  on  n  and  $  permits  fast,  efficient  computation  for  any  arguments 
n,  5  >  1.  Note  that  the  second  initial  condition  for  this  recursion  depends  on  the  probabilities  V  91  = 
for  n,  k  €  {0, 1,2,.. .}.  These  probabilities  require  iterated  convolutions  of  the  PMF  of  the  random  gain 
gi ,  may  be  precomputed  and  stored  for  small  n  and  k,  and  may  be  approximated  accurately  for  large 
n  and  k.  It  is  easy  to  show  that  V  gi  —  nj  has  an  explicit  form  for  random  gains  that  are 
shifted-Poisson-distributed  in  addition  to  the  unity-gain  case. 


It  is  helpful  at  this  point  to  look  at  (9)  in  the  special  case  of  one  interferer  and  unity-gain 
photodetection.  In  this  case  (9)  becomes 


M 

VW  =  n  I  *]  =  £  D(j)V  [n(d+  X  +  j. i)  =  n] 
i= o 

M- 1 

+  E  <?(}){?  [n(cf+*  +  j-£)<n]  -^[n(d+r  +  (i  +  l)~)<n]} 
i=o  r 


where  II(m)  is  a  Poisson  random  variable  with  mean  m.  From  this  equation,  we  see  that  if  the  interference 
takes  on  a  value  corresponding  to  a  point  mass,  then  the  contribution  to  the  count  PMF  is  a  Poisson 
PMF,  as  expected.  If  the  interference  takes  on  a  value  between  the  point  masses,  then  the  contribution 
to  the  PMF  of  Af  is  the  difference  between  two  Poisson  cumulative  distribution  functions  (CDFs). 
This  latter  fact  is  due  to  the  piecewise  constant  nature  of  the  continuous  portion  of  the  interference 
distribution. 

We  have  shown  that  the  PMF  of  A f  may  be  expressed  as  a  finite  summation  of  terms  involving 
the  function  W.  We  have  also  demonstrated  that  W  may  be  computed  by  a  linear  recursion  on  its  integer 
arguments.  These  expressions  are  valid  when  the  distribution  of  the  normalized,  partial  cross-correlations 
is  7-type  (Figure  2)  such  as  in  the  cases  of  user  asynchronism  and  chip-synchronism.  It  should  be  pointed 
out  that  the  number  of  terms  in  the  PMF  summation  (9)  is  exponential  in  the  number  of  interferers,  and 
may  be  prohibitively  large  for  large  K.  In  the  next  section  we  show  that  by  modifying  the  distribution 
of  the  interference  mixtures,  we  may  use  a  special  case  of  (9)  to  express  arbitrarily  tight  bounds  on  the 
PMF  as  a  summation  that  is  bilinear  in  the  number  of  interferers  K  —  1  and  the  parameter  M. 


4.  Arbitrarily  Tight  Bounds  onV[Af  <  n  |  x] 

Cornputationally-efficient  bounds  must  reduce  the  complexity  of  (9)  in  both  the  multinomial 
summation  and  the  computation  of  W,  while  controlling  the  loss  of  accuracy  by  a  parameter  of  our 
selection.  In  this  section  we  show  that  by  quantizing  the  interference  mixtures,  we  achieve  all  three 
objectives.  The  intuition  behind  the  bounds  is  demonstrated  in  the  following  special  case.  Suppose  that 
each  interference  mixture  has  a  discrete  distribution  on  the  set  {0,  <j,  jy,  •  ■  ■ ,  M}  containing  QM  +  1 
elements.  In:  this  case  the  conditional  mean  A  takes  on  ( K  -  1  )QM  +  1  possible  values.  When  A 
has  a.  discrete  distribution,  C(j)  =  0  Vj,  and  the  conditional  CDF  of  Af  given  x  and  all  partial  cross¬ 
correlations  is  given  by  (9)  as 

n 

v[Af<n\x,(R21)R21)...  (RKl ,  RKl)\  =  £  W(i,  A,  0).  (12) 

<=o 
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Since  the  right-hand  side  depends  only  on  an  initial  condition  for  W,  the  assumption  of  a  discrete 
distribution  on  A  eliminates  the  need  to  compute  the  linear  recursion  for  W.  Also,  computation  is 
reduced  substantially  by  noting  that  the  conditional  CDF  of  Af  depends  on  {R 

the  conditional  mean  A  =  sc  +  d+p  £  f=2  &j-l  l  +  bjo&ji  •  This  is  not  true  for  the  exact  interference 
distribution,  in  which  case  the  conditional  CDF  is  also  dependent  on  6  =  ]T)jL2  mj>  the  total  number  of 
interference  mixtures  that  take  on  non-integer  values.  The  effect  of  6  on  the  exact  PMF  may  be  seen  in 
(9).  This  observation  allows  us  to  uncondition  (12)  over  the  distribution  of  the  ( K  —  1  )QM  +  1-valued 
conditional  primary  electron  mean  A,  rather  than  the  joint  distribution  of  the  interference  mixtures, 
which  would  require  (2 M  +  l)^”1  terms.  Quite  simply,  a  discrete  distribution  for  the  interference 
mixtures  yields  a  CDF  for  A/*  that  is  linear  rather  than  exponential  in  the  number  of  interferers. 

But  how  do  we  obtain  arbitrarily-tight  bounds  on  the  exact  conditional  error  rate  V  [Af  <  n  |  k] 
that  use  a  discrete  interference  mixture  distribution?  Suppose  we  quantize  each  term  of  the  interference 
mixture  bj-iRji  +  bjoRji  with  a  step  size  of  q,  Q  6  -{1,2, . . .},  and  round-up  or  round-down  to  form 
bounds  on  the  interference  mixtures.  That  is,  we  form  A/,  Au  from  (3) 

K  .  . 

AJ  =  *  +  d  +  J  Xj  bj-i  -  IQRji]  +  bjo—  LQ-RjiJ 

and 

A  u  =  x  +  d+~Y^  bj +  bj0^QRj{] , 

where  [fij  ( f" /?*] )-  is  the  greatest  (least)  integer  function  of  R.  Although  it  is  obvious  that  A;  <  A  <  Au, 
it  is  not  clear  that  we  may  form  bounds  on  the  secondary  electron  count  CDF  by  substituting  A /  and 
Au  for  A.  A  subtle  point  is  raised  by  considering  the  form  of  A f 


n(A) 

^(A)  =Yj9p 


p=  i 


(13) 


where  11(A)  is  a  conditionally-Poisson  count  with  conditional  mean  A.  Since  the  random  gains  gp  are 
non-negative,  M  increases  with  the  primary  electron  count,  II.  It  is  easy  to  show  that  A;  <  A  implies 
■P(II(A)  <  n]  <  ‘P[H(A|)  <  n]  for  all  n  [17],  yet  it  is  not  clear  from  (13)  that  corresponding  bounds 
on  the  CDF  of  M  follow  from  this  fact.  In  the  lemma  below  we  show  that  we  may  achieve  bounds  on 
V[M<  n  |  x]  by  using  the  distributions  of  A /  and  Au.  This  is  shown  by  first  considering  the  case  when 
A  is  deterministic. 

Lemma  1,  Let  11(a)  be  a  Poisson  random  variable  with  mean  a,  and  let  A 7(a)  =  9h> 

where  { g is  a  collection  of  nonnegative  i.i.d.  random  variables  that  is  independent  of  11(a).  Let 
0  <  a'  <  a.  Then 


V[A f{a)  <  n]  <  V  [A /’(o')  <n]  ,  n  >  0. 
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Proof. 


V[M{a)  <n)  =  f^V[H(a)  =  k]V[9i  +  ...  +  gk  <n] 
k~  0 

=  f  ){v  [n(a)  <k]-v  [n(a)  <  k  -  1]}  V  [9l  +  . . .  +  9k  <  n] 

k=0 

=  f>P(«)<*]  {P\gi  +  ...  +  gk<n]-V[gi  +  ...  +  gk+i<n]} 
k= 0 
co 

<  X)  ^  [n(a0  <  fc]  [fll  +  •  •  ■  +  ffjfc  <  nJ  -  ^[31  +  •  •  •  +  9k+ 1  <  »]} 
k=0 

=  P  [jV(a')  <  n] 

The  inequality  follows  from  the  fact  that  the  Poisson  CDF  is  a  decreasing  function  of  the  mean.  This 
lemma  extends  easily  to  the  case  when  11(A)  is  conditionally  Poisson  with  conditional  mean  A.  By 
conditioning  on  A,  we  have 

V[Af{A)  <  n  |  A]  <  V  [A/*(Ai)  <  n  |  A] 

from  Lemma  1  since  both  A  and  A;  are  known  under  this  conditioning.  The  desired  result  follows  by 
smoothing  with  respect  to  the  distribution  of  A.  Lemma  1  allows  us  to  compute  an  upper  bound  of  the 
CDF  of  Af  under  each  hypothesis  by  averaging  the  right  hand  side  of 

V (Af(A)  <  n  |  x,  A,  A(]  <  V  [Af(A,)  <  n  |  x,  A,  A,] 

over  the  ( K  -  l)QM  +  1-valued  distribution  of  A;.  The  tightness  and  complexity  of  the  upper  (and 
lower)  bounds  are  controlled  by  the  quantization  step  size  It  is  obvious  by  comparing  the  number  of 
terms  per  point  of  the  CDF  to  those  of  the  bounds  that  even  for  moderate  numbers  of  transmitters  and 
fine  quantization,-  the  bounds  require  significantly  less  computation. 


5.  Arbitrarily  “Tight  Error  Rate  Bounds  for  Unequal  Energies 

Using  Lemma  1,  we  may  form  bounds  on  the  count  CDF  under  each  hypothesis  by  developing 
bounds  on  the  underlying  conditional  mean.  In  the  case  of  equal  energies,  we  may  achieve  this  by 
uniformly  quantizing  each  interference  mixture,  and  as  the  quantization  step  size  decreases,  the  accuracy 
of  the  bounds  improves.  In  this  section  we  show  that  the  same  technique  may  be  applied  to  the  case  of 
unequal  energies. 

We  may  express  the  conditional  mean  as 

1  K 

A  =  x  +  d+  —  ^  sj[bj-iRji  +  bjoRji], 
i= 2 

where  3j  is  the  energy  per  “1”  for  user  j  and  x  =  si6io-  In  this  case  we  define  A/,  Au  as 

a  {  =  x  +  d+J2  bi-^sjRn\  + 


n 


K 


Au  =  x  +  d  +  Y^  bj-i^lQ^Rj{\  +  bj0^\Q^Rj{\. 


Note  that  the  quantity  ^  appears  inside  each  quantizer.  Unlike  the  equal-energy  case,  we  cannot  form 
A /  and  Au  by  quantizing  the  partial  cross-correlations  directly.  Instead,  each  must  be  scaled  by  the 
appropriate  energy  3j.  However,  it  is  clear  that  Aj  <  A  <  AU)  and  we  may  form  bounds  on  the  count 

CDF  using  quantization  and  Lemma  1.  The  quantized  mean  may  take  on  at  most  (MQmax  ^  +  l)^-1  — 

(K  -  1)  values,  where  M  is  the  maximum  for  the  common  interference  mixture.  The  distributions  of  A/ 
and  Au  are  obviously  discrete,  and  may  be  computed  exactly.  More  importantly,  we  may  uncondition 
(12)  by  the  distribution  of  Au  to  obtain  a  lower  bound  to  the  count  CDF  under  each  hypothesis.  The 
bounds  on  the  CDF  of  A/"  are  most  easily  seen  from  (9)  with  the  following  abuse  of  notation.  Let 

M  =  Q  max(Au  —  (x  +  d )),  and  let  D(j)  =  V  |au  =  x  +  d  +  denote  the  distribution  of  A'u  in  (9). 
Since  max(Au  -  (x  +  d))  is  a  multiple  of  l/Q  due  to  quantization,  M  is  an  integer.  The  idea  is  to 
consider  Au  as  the  quantized  interference  due  to  one  user  having  an  interference  mixture  distribution 
{£>(0) , . . .  D(M)} .  To  complete  this  analogy,  we  must  set  K  —  1  =  1  in  (9).  The  corresponding  CDF 
bound  is 


n  M 


■P  W  <  n  |  x]  >  £ E  °(i)W(v *  +  d  +  £,l °) 
i=0  j=0  V 


Note  that  we  have  set  a  =  1  in  (9),  since  the  interference  energies  are  incorporated  in  the  quantization. 
An  upper  bound  to  V  [Af  <  n  |  x]  follows  from  A /  in  the  same  way.  It  is  helpful  to  note  that  for  unity-gain 
photodetection  the  above  bound  becomes 


M 

V[M  <n\x]>YJD{j)V 
j=o 

where  II(m)  is  a  Poisson  random  variable  with  mean  m.  This  follows  from  the  conditional  Poisson 
nature  of  A f,  when  driven  by  a  conditional  mean  with  a  discrete  distribution. 


n(x  +  d  + 


6.  Example:  Equal  Energies,  Prime  Sequences  and  PIN  Photodiodes 

In  order  to  compare  the  exact  error  rate  to  the  approximations  discussed  earlier,  we  must 
first  compute  the  7-type  distribution  (D(0), .. . ,  D(M),  C(  0), .. .  ,C(M  -  1)),  which  is  used  in  (9).  This 
distribution  may  be  computed  once  the  signature  sequence  set  and  the  distribution  of  the  relative  delays 
are  specified.  In  this  section  we  focus  on  the  set  of  prime  sequences  [18]  in  the  user-asynchronous  and 
chip-synchronous  cases.  We  shall  also  assume  equal  energies  for  all  users. 

Since  the  normalized  cross-correlations  of  prime  sequences  are  bounded  above  by  M  =  2, 
we  must  compute  (D(Q),  D(l),  D(2),  C(0),  C(l)}  for  the  chip-synchronous  and  asynchronous  cases  [18]. 
For  the  prime  sequences  frcrn  GF(31),  we  have  found  that  the  average  distributions  for  the  interference 
mixtures  are  given  to  two  significant“digits  as 


D(0)  D(l)  D(2)  C(0)  C(l) 
chip-synchronous  users  &  .57  .36  .07  .00  .00 
asynchronous  users  =t>  .44  ,22  -01  .24  .09 

Table  1.  Average  distributions  of  the  interference  mixtures. 
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We  have  verified  that  the  MAI  for  prime  sequences  is  well-modeled  by  a  sum  of  iid  random 
variables.  In  particular,  the  mean,  variance,  and  third  central  moment  using  the  average  distribution 
for  each  interferer  were  identical  to  the  exact  MAI  moments,  while  the  fourth  central  moments  differed 
by  less  than  .004%  for  29  interferes.  Further,  the  average  distribution  of  the  GF(31)  prime  sequences 
did  not  differ  significantly  from  those  of  the  GF(ll)  and  GF(17)  prime  sequences.  For  this  reason  we 
shall  use  the  distribution  in  Table  1  for  all  calculations.  It  should  be  pointed  out  that  bounds  may  be 
achieved  by  considering  best  and  worst  case  (common)  interference  distributions,  as  was  done  in  [8j. 


Figure  3.  Comparison  of  the  Minimum  Error.Rates  For  Asynchronous 
Users  and  the  Chip-Synchronous  Approximation 


In  Figure  3  we  have  plotted  the  minimum  error  probability  of  the  matched-filter  CDMA 
receiver  for  the  chip-synchronous  approximation  and  for  completely  asynchronous  transmission.  We 
have  used  the  weight  17  and  length  289  prime  sequences  from  GF(17),  an  optical  energy  per  user  of 
s=1000  photons  per  bit,  and  a  dark  current  mean  of  d=50  thermoelectrons  per  bit.  For  a  single-user 
transmission  rate  of  V,  Gigabits  per  second,  these  numbers  correspond  to  a  peak  received  single-user 
power  of  H  pW  and  a  photodetector  dark  current  of  approximately  10  •  7Z  nA.  From  Figure  3  we  see 
that  the  chip-synchronous  approximation  upper-bounds  the  error  rate  in  the  asynchronous  case  by  at 
least  one  order  of  magnitude.  The  error  rates  arc  ordered  in  this  way  due  exclusively  to  the  differences 
of  the  distributions  of  the  interference  mixtures.  From  Table  1,  it  may  be  shown  that  the  means  of  the 
interference  mixtures  are  identical  in  both  cases,  while  the  ordering  of  the  variances  coincides  with  that 
of  the  error  rates.  Thus  the  MAI  has  identical  means  under  these  distributions,  and  second  moments 
whose  ordering  coincides  with  that  of  the  error  rates.  In  the  unity-gain  case  it  is  easy  to  show  that 
EfA/- j  x]  =  A,  and  Var(Jsf\x)  =  A  — (A)2  +  A2,  which  implies  that  under  each  hypothesis  the  mean 
of  A/"  is  unchanged  by  the  approximation  of  chip  synchronism,  yet  the  variance  of  M  increases  as  we 
proceed  from  complete  asynchronism  to  chip  synchronism.  From  the  ordering  of  the  minimum  error  rate 
curves  in  Figure  3,  we  see  that  an  increase  in  the  variance  of  A/-  under  each  hypothesis  results  in  an 
increased  error  rate  as  the  conditional  means  of  A f  are  fixed. 

In  [7]  it  was  shown  by  example  that  the  variance  of  the  interference  mixture  increases  under 
chip  synchronism  for  optical  orthogonal  codes  that  are  bounded  by  one  chip  of  interference.  We  will 
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now  show  that  an  interference  mixture  bj-iRji  +  bjoRji  has  a  larger  variance  under  chip  synchronism 
than  asynchronism  for  arbitrary,  deterministic  signature  sequences.  Due  to  the  independence  of  the 
users,  this  implies  that  the  MAI  has  a  larger  variance  under  chip  synchronism.  We  will  prove  this  in 
two  steps:  we  shall  show  first  that  each  cross-correlation  Rji,Rj 1(  and  Rji  +  Rji  has  a  larger  variance 
under  chip-synchronism,  and  then  we  shall  show  that  the  same  is  true  for  the  interference  mixture. 

Lemma  2.  The  variance  of  the  MAI  is  greater  under  chip  synchronism  than  complete 
asynchronism. 

Proof.  Let  R(r)  denote  any  of  the  above  cross-correlations.  Then  the  distribution  of  R(r) 
is  7-type,  and  R(t)  has  an  upper  bound  of,  say,  M.  R(t)  is  a  piecewise  linear,  continuous  function 
of  the  relative  delay  r  6  [0,  T\.  In  fact,  iJ(r)  is  linear  between  its  chip-synchronous  values.  Due  to  the 
uniform  distribution  on  the  delay,  and  the  piecewise  linearity  of  R(r),  it  is  easy  to  show  that  the  mean 
of  R(r)  is  the  same  under  chip  synchronism  or  asynchronism.  (The  same  is  true  for  the  interference 
mixture  by  similar  arguments.)  Thus,  it  is  sufficient  to  show  that  the  second  moment  of  R(r)  is  greater 
under  chip  synchronism  than  complete  asynchronism. 

The  distribution  of  R(r)  under  complete  asynchronism  has  the  form  (d(0), . . .  d(M),  c(0), . . . , 
c(M  —  1)),  where  c(j)  represents  a  piecewise  constant  portion  between  j  and  j  +  1.  Equivalently,  this 
distribution  may  be  described  by  (d(0), . . . ,  d(M),  {um*},  for  m,k  =  0  ,...M  and  rn  <  k),  where 
is  the  height  of  a  square  pulse  on  the  interval  (m,  k ).  These  square  pulses  overlap,  and  the  sum 

of  all  such  that  (j,j  +  1)  C  ( m,k )  is  equal  to  c(j).  The  value  umk  is  equal  to  the  fraction 

of  consecutive  values  of  {m,  k}  or  {&,m}  in  the  sequence  of  chip  synchronous  values  of  R(r).  We 
introduce  this  decomposition  for  the  following  two  reasons.  First,  the  second  moment  of  R(r)  is  linear 
in  each  of  the  um*.  Second,  the  probability  mass  on  the  interval  (m,  k )  represented  by  umk  under  user 
asynchronism  vanishes  to  the  endpoints  m  and  A;  under  chip  synchronism  in  such  a  way  as  the  center  of 
mass  is  conserved  on  the  interval  (m,  k).  The  latter  fact  is  the  reason  that  the  means  of  R(t)  coincide 
under  chip  synchronism  and  complete  asynchronism.  Because  of  these  two  facts,  it  is  sufficient  to  show 
that  under  the  constraint  of  a  constant  mean,  the  uniform  distribution  on  (m,  k)  has  a  smaller  second 
moment  than  the  discrete  distribution  on  {m,fc}.  This  condition  is  easy  to  show,  and  it  follows  that 
the  variance  of- any  cross-correlation  Rji ,  Rji,  or  Rji  +  Rji  between  deterministic  sequences  is  greater 
under  chip  synchronism  than  asynchronism. 

Now  we  would  like  to  show  that  the  interference  mixture  bj-iRji+bjoRji  has  a  higher  vari¬ 
ance  under  chip  synchronism.  Since  the  means  of  this  random  variable  coincide  under  chip  synchronism 
and  complete  asynchronism,  it  is  sufficient  to  show  that  the  second  moment  of  the  interference  mixture 
is  larger  under  chip  synchronism.  But  this  follows  from  the  same  fact  for  the  cross-correlations,  since 
the  distribution  of  the  interference  mixture  is  a  convex  combination  of  the  marginals  of  Rji, Rji,  and 

Rji  +  Rji . 

Since  the  variance  of  each  interference  mixture  increases  under  chip  synchronism,  the  same 
is  true  for  the  MAI.* 


Direct-detection  communication  systems  often  require  large  received  optical  energies  to  achieve 

•an  acceptable  error  rate.  Therefore,  we  are  interested  in  the  asymptotic  distribution  of  Af  as  s  grows 

without  bound.  In  the  simplest  case  when  A  is  deterministic  (A f  is  compound-Poisson)  it  is  well-known 

that  a  normalized  version  of  Af  converges  in  distribution  to  a  Gaussian  random  variable.  The  asymptotic 

distribution  has  also  been  established  for  random  A  and  unity-gain  photodetection  [19,20].  This  result 

requires  that  A  — ►  oo  and  cr ^  >  0,  and  shows  that  the  asymptotic  distribution  depends  on  the  limit 
^  2 

of  p  =  -£■.  If  limp  =  0,  then  the  normalized  count  converges  in  distribution  to  a  standard  Gaussian 
random  variable.  If  limp  =  oo,  then  the  normalized  count  converges  in  distribution  to  A  =  lim—^-. 
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Finally,  if  0  <  limp  <  oo,  then  the  normalized  count  converges  in  distribution  to  an  independent 
mixture  of  A  and  a  standard  Gaussian.  This  result  has  also  been  generalized  to  include  the  case  of 
photomultiplication  [21].  We  state  this  result  below. 

Lemma  3.  Let  13  be  a  conditionally  Poisson  random  variable  with  mean  A  when  conditioned 
on  A,  and  let  Af  =  £j?=0  9k>  where  { g are  iid  positive  random  variables  with  finite  mean  and  variance. 

Suppose  that  {g^}  are  independent  of  A,  <r\  >  0,  and  define  p  =  Then  if  A  — +  oo  the  normalized 

random  variable  Af  =  converges  in  distribution  as  follows.  If  limp  =  0,  then  Af,  converges  to 

a  standard  Gaussian  random  variable.  If  limp  =  oo,  then  Af,  then  converges  to  A.  Finally,  if  limp  is 
finite  and  non-zero,  then  Af  converges  to  a  mixture  of  the  two  limiting  cases  above.f 


Lemma  3  is  most  easily  shown  by  taking  the  limit  of  the  characteristic  function  of  Af.  In  the 
matched-filter  CDMA  receiver,  A  is  proportional  to  a,  and  A,  p  — *•  oo.  Therefore,  the  normalized  count 
converges  in  distribution  to  the  scaled  conditional  mean  A  as  a  — *■  oo.  This  asymptotic  result  is  more 
commonly  known  as  “perfect  optical-to-electrical  conversion”.  It  is  important  to  note  that  A  is  not  a 
Gaussian  random  variable,  as  it  is  composed  of  a  finite  number  of  bounded,  mixed  random  variables.  It 
follows  from  Lemma  3  that  the  normalized  count  will  not  converge  to  a  Gaussian  random  variable  as 
s  — *  oo,  in  contrast  to  the  case  when  A  is  deterministic.  In  fact,  it  will  be  shown  in  the  numerical  results 
that  the  observed  count  Af  is  poorly  approximated  by  a.signal  in  additive  Gaussian  noise. 

In  Figure  4  we  compare  the  exact  minimum  error  rate  of  the  CDMA  matched-filter  receiver 
to  those  obtained  with  simplifying  approximations.  For  these-  calculations  we  have  chosen  the  signature 
sequences  from  the  GF(ll)  prime  codes,  which  have  a  length  of  121  chips  and  a  weight  of  11  chips.  We 
have  also  assumed  that  the  average  number  of  thermoelectrons  is  much  less  than  the  average  number 
of  photoelectrons.  As  expected,  the  exact  minimum  error  rate  increases  with  the  number  of  users,  and 
is  a  decreasing  function  of  the  single-user  power.  We  see  that  the  error  rate  curves  seem  to  converge 
to  that  predicted  by  Lemma  3  «s  a  increases.  However,  the  asymptotic  curve  is  a  lower  bound  to  the 
exact  error  rate  by  an  order  of  magnitude  for  optical  powers  less  than  10,000  photons  per  bit.  Also 
note  that  each  exact  error  rate  curve  (constant  s)  approaches  the  asymptotic  curve  corresponding  to 
Gaussian-distributed  MAI  as  the  number  of  users,  K,  increases.  This  fact  may  also  be  justified  by 
Lemma  3.  However,  in  this  case  both  A  and  a\  are  proportional  to  ( K  —  1),  and  the  limiting  value  of  p 
is  finite.  We  conclude  from  Lemma  3  that  the  asymptotic  distribution  of  Af  for  large  K  is  a  mixture  of 
the  random  variable  A  and  an  independent  Gaussian  random  variable.  In  addition,  A  converges  in  law 
to  the  Gaussian  distribution  by  the  central  limit  theorem  as  the  number  of  users  increases.  Therefore, 
for  fixed  single-user  power  and" increasing  number  of  users,  Af  converges  weakly  to  a  Gaussian  random 
variable.  This  fact  is  illustrated  in  Figure  4,  where  the  asymptotic  error  rate  curve  approaches  the  error 
rate  curve  for  the  approximation  of  Gaussian-distributed  MAI.  However,  we  note  that  the  error  rate  is 
unacceptably  high  in  the  region  in  which  the  Gaussian-distributed  MAI  approximation  is  tight.  It  is 
also  evident  from  Figure  4  that,  in  general,  the  Gaussian-distributed  MAI  approximation  yields  a  poor 
estimate  of  the  system  performance. 

Using  the  numerical  results  in  Figure  4  we  may  also  address  the  “error-free”  condition,  an 
artifact  of  perfect  optical-to-electrical  conversion.  Under  this  approximation,  the  observation  is  additively 
composed  of  the  desired  user’s  energy  and  the  MAI.  Since  the  desired  signal  is  on-off  keying,  the  “error- 
free”  condition  exists  when  the  signal  peak  is  greater  than  the  maximum  of  the  MAI.  Since  the  prime 
codes  have  a  cross-correlation  bound  of  2,  this  condition  occurs  when  the  number  of  interferers  is  less 
than  half  the  weight  of  the  sequences.  Therefore,  the  “error-free”  condition  should  occur  for  less  than 
6  users  in  Figure  4.  The  exact  error  rate  curves  in  this  figure  indicate  that  “error-free”  performance 
is  approximated  for  incident  optical  energies  exceeding  10,000  photons  per  bit  -  the  error  rate  for  K=6 
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Figure  4.  Exact  error  rate  curves  vs.  number  of 
users  for  various  optical  energies.  All  users  have 
equal  signal  energies.  Also  shown  are  the  curves 
corresponding  to  the  high  energy  limit,  and 
Gau  Solan-distributed  MAI. 


at  this  energy  is  roughly  10  14.  At  optical  energies  less  than  10,000  photons  per  bit,  the  “error-free” 
phenomenon  is  not  observed. 

An  important  byproduct  of  this  analysis  is  the  optimal  threshold  function  for  the  matched- 
filter  CDMA  receiver.  In  Figure  5  we  have  plotted  the  optimal  thresholds  for  those  error  rate  curves 
plotted  in  Figure  4.  Each  threshold  function  is  normalized  by  the  respective  signal  energy  per  bit.  As  the 
incident  optical  energy  per  bit  increases,  the  normalized  optimal  threshold  increases  to  unity,  which  is 
the  curve  corresponding  to  the  asymptotically  optimal  test.  Note  that  the  combined  approximations  of 
Gaussian-distributed  MAI  and  perfect  optical-to-electrical  conversion  yield  a  threshold  that  significantly 
underestimates  the  exact  optimal  threshold  for  moderate  and  large  received  optical  energies.  For  this 
region  the  high-energy  test  (using  the  exact  MAI  distribution)  yields  a  more  accurate  estimate  of  the 
optimal  threshold.  This  fact  further  illustrates  that  the  observable  is  not  well-modeled  as  a  Gaussian 
random  variable  for  any  optical  power.  Optimal  thresholds  for  large  incident  optical  energies  are  not 
plotted  for  the  “error-free”  region  because  they  could  not  be  reliably  determined  due  to  the  vanishing 
error  rate. 


7.  Conclusions 

In  this  paper  we  have  presented  a  semi-classical  analysis  of  the  error  rate  for  a  noncoherent, 
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Figure  5.  Optimal  thresholds  for  the  matched-filter  CDMA  receiver 


matched-filter  CDMA  receiver  in  the  optical  channel.  The  error  rate  expression  is  valid  for  a  common 
photomultiplier  model,  arbitrary  signature  sequences  and  equal  energies  among  the  users.  In  this  paper 
we  also  developed  arbitrarily-tight  bounds  on  the  error  rate  for  unequal  energies.  In  the  case  when 
the  signal  energies  coincide,  these  bounding  expressions  are  considerably  easier  to  compute  than  the 
exact  error  rate.  The  exact  error  rates  and  optimal  thresholds  were  compared  to  those  obtained  through 
various  approximations  for  the  special  case  of  prime  sequences  and  unity  gain  photodiodes,  and  the 
accuracy  of  these  approximations  was  addressed  for  various  received  optical  energies  and  numbers  of 
transmitters.  It  was  demonstrated  that  the  chip-synchronous  approximation  yielded  minimum  error 
rates  that  upper-bounded  the  exact  error  rates,  that  the  approximation  of  perfect  optical-to-electrical 
conversion  was  accurate  only  when  the  single-user  optical  energy  per  bit  was  much  larger  than  10000 
photons  per  bit,  and  that  the  combined  approximations  of  perfect  optical-to-electrical  conversion  and 
Gaussian-distributed  MAI  were  appropriate  only  for  numbers  of  transmitters  that  yielded  unacceptable 
error  rates  for  a  moderate  number  of  chips  per  bit. 
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